Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction
|
2022 |
Stöger, Dominik |
Efficient Training of Visual Transformers with Small Datasets
|
2022 |
Liu, Yahui |
CoFiNet: Reliable Coarse-to-fine Correspondences for Robust PointCloud Registration
|
2022 |
Yu, Hao |
LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
|
2022 |
Kusupati, Aditya |
POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples
|
2022 |
Le, Duong |
Efficiently Learning One Hidden Layer ReLU Networks From Queries
|
2022 |
Chen, Sitan |
Learning to Schedule Heuristics in Branch and Bound
|
2022 |
Chmiela, Antonia |
MLP-Mixer: An all-MLP Architecture for Vision
|
2022 |
Tolstikhin, Ilya O |
Communication-efficient SGD: From Local SGD to One-Shot Averaging
|
2022 |
Spiridonoff, Artin |
Can we globally optimize cross-validation loss? Quasiconvexity in ridge regression
|
2022 |
Stephenson, Will |
Discovering and Achieving Goals via World Models
|
2022 |
Mendonca, Russell |
Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems
|
2022 |
Keshishian, Menoua |
Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess
|
2022 |
McIlroy-Young, Reid |
Coupled Gradient Estimators for Discrete Latent Variables
|
2022 |
Dong, Zhe |
MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms
|
2022 |
Kyono, Trent |
Bias and variance of the Bayesian-mean decoder
|
2022 |
Prat-Carrabin, Arthur |
Efficient Combination of Rematerialization and Offloading for Training DNNs
|
2022 |
Beaumont, Olivier |
Analytic Insights into Structure and Rank of Neural Network Hessian Maps
|
2022 |
Singh, Sidak Pal |
Well-tuned Simple Nets Excel on Tabular Datasets
|
2022 |
Kadra, Arlind |
Fair Algorithms for Multi-Agent Multi-Armed Bandits
|
2022 |
Hossain, Safwan |