The Curse of Unrolling: Rate of Differentiating Through Optimization
|
2023 |
Scieur, Damien |
A Deep Learning Dataloader with Shared Data Preparation
|
2023 |
xie, jian |
Learning Physics Constrained Dynamics Using Autoencoders
|
2023 |
Yang, Tsung-Yen |
Peer Prediction for Learning Agents
|
2023 |
Feng, Shi |
Mirror Descent with Relative Smoothness in Measure Spaces, with application to Sinkhorn and EM
|
2023 |
Aubin-Frankowski, Pierre-Cyril |
APT-36K: A Large-scale Benchmark for Animal Pose Estimation and Tracking
|
2023 |
Yang, Yuxiang |
Locating and Editing Factual Associations in GPT
|
2023 |
Meng, Kevin |
Neural Topological Ordering for Computation Graphs
|
2023 |
Gagrani, Mukul |
Probable Domain Generalization via Quantile Risk Minimization
|
2023 |
Eastwood, Cian |
All Politics is Local: Redistricting via Local Fairness
|
2023 |
Ko, Shao-Heng |
Bayesian Risk Markov Decision Processes
|
2023 |
Lin, Yifan |
Biologically-plausible backpropagation through arbitrary timespans via local neuromodulators
|
2023 |
Liu, Yuhan Helena |
Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning
|
2023 |
Liang, Victor Weixin |
α-ReQ : Assessing Representation Quality in Self-Supervised Learning by measuring eigenspectrum decay
|
2023 |
Agrawal, Kumar K |
Operative dimensions in unconstrained connectivity of recurrent neural networks
|
2023 |
Krause, Renate |
Batch size-invariance for policy optimization
|
2023 |
Hilton, Jacob |
Variational Model Perturbation for Source-Free Domain Adaptation
|
2023 |
Jing, Mengmeng |
On the non-universality of deep learning: quantifying the cost of symmetry
|
2023 |
Abbe, Emmanuel |
Sharper Convergence Guarantees for Asynchronous SGD for Distributed and Federated Learning
|
2023 |
Koloskova, Anastasiia |
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
|
2023 |
Xie, Yujia |