Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction
|
2022 |
Stöger, Dominik |
Efficient Training of Visual Transformers with Small Datasets
|
2022 |
Liu, Yahui |
CoFiNet: Reliable Coarse-to-fine Correspondences for Robust PointCloud Registration
|
2022 |
Yu, Hao |
LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes
|
2022 |
Kusupati, Aditya |
POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples
|
2022 |
Le, Duong |
Efficiently Learning One Hidden Layer ReLU Networks From Queries
|
2022 |
Chen, Sitan |
Learning to Schedule Heuristics in Branch and Bound
|
2022 |
Chmiela, Antonia |
MLP-Mixer: An all-MLP Architecture for Vision
|
2022 |
Tolstikhin, Ilya O |
Communication-efficient SGD: From Local SGD to One-Shot Averaging
|
2022 |
Spiridonoff, Artin |
Can we globally optimize cross-validation loss? Quasiconvexity in ridge regression
|
2022 |
Stephenson, Will |
Discovering and Achieving Goals via World Models
|
2022 |
Mendonca, Russell |
Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems
|
2022 |
Keshishian, Menoua |
Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess
|
2022 |
McIlroy-Young, Reid |
Coupled Gradient Estimators for Discrete Latent Variables
|
2022 |
Dong, Zhe |
Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates
|
2022 |
Bachoc, Francois |
Memory Efficient Meta-Learning with Large Images
|
2022 |
Bronskill, John |
Fast and accurate randomized algorithms for low-rank tensor decompositions
|
2022 |
Ma, Linjian |
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training
|
2022 |
Sarma, Anup |
Recognizing Vector Graphics without Rasterization
|
2022 |
JIANG, XINYANG |
Active Offline Policy Selection
|
2022 |
Konyushova, Ksenia |