Global Convergence and Stability of Stochastic Gradient Descent
|
2023 |
Patel, Vivak |
Extrapolative Continuous-time Bayesian Neural Network for Fast Training-free Test-time Adaptation
|
2023 |
Huang, Hengguan |
Scaling Multimodal Pre-Training via Cross-Modality Gradient Harmonization
|
2023 |
Wu, Junru |
Regret Bounds for Risk-Sensitive Reinforcement Learning
|
2023 |
Bastani, Osbert |
Chain of Thought Imitation with Procedure Cloning
|
2023 |
Yang, Mengjiao (Sherry) |
Out-of-Distribution Detection via Conditional Kernel Independence Model
|
2023 |
Wang, Yu |
ResT V2: Simpler, Faster and Stronger
|
2023 |
Zhang, Qinglong |
Learning Partial Equivariances From Data
|
2023 |
Romero, David W. |
Data-Efficient Structured Pruning via Submodular Optimization
|
2023 |
El Halabi, Marwa |
Interpolation and Regularization for Causal Learning
|
2023 |
Chennuru Vankadara, Leena |
Fuzzy Learning Machine
|
2023 |
Cui, Junbiao |
Contextual Squeeze-and-Excitation for Efficient Few-Shot Image Classification
|
2023 |
Patacchiola, Massimiliano |
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation
|
2023 |
Ji, Yuanfeng |
Learning Structure from the Ground up—Hierarchical Representation Learning by Chunking
|
2023 |
Wu, Shuchen |
Masked Autoencoders As Spatiotemporal Learners
|
2023 |
Feichtenhofer, Christoph |
On the Parameterization and Initialization of Diagonal State Space Models
|
2023 |
Gu, Albert |
Rethinking Alignment in Video Super-Resolution Transformers
|
2023 |
Shi, Shuwei |
Learning to Scaffold: Optimizing Model Explanations for Teaching
|
2023 |
Fernandes, Patrick |
Robustness in deep learning: The good (width), the bad (depth), and the ugly (initialization)
|
2023 |
Zhu, Zhenyu |
Bounded-Regret MPC via Perturbation Analysis: Prediction Error, Constraints, and Nonlinearity
|
2023 |
Lin, Yiheng |