Kantorovich Strikes Back! Wasserstein GANs are not Optimal Transport?
|
2023 |
Korotin, Alexander |
Distributional Convergence of the Sliced Wasserstein Process
|
2023 |
Xi, Jiaqi |
Training Scale-Invariant Neural Networks on the Sphere Can Happen in Three Regimes
|
2023 |
Kodryan, Maxim |
Multi-Scale Adaptive Network for Single Image Denoising
|
2023 |
Gou, Yuanbiao |
Graph Self-supervised Learning with Accurate Discrepancy Learning
|
2023 |
Kim, Dongki |
Hierarchical Normalization for Robust Monocular Depth Estimation
|
2023 |
Zhang, Chi |
Near-Optimal Collaborative Learning in Bandits
|
2023 |
Réda, Clémence |
Approximate Secular Equations for the Cubic Regularization Subproblem
|
2023 |
Gao, Yihang |
Knowledge-Aware Bayesian Deep Topic Model
|
2023 |
Wang, Dongsheng |
Variational inference via Wasserstein gradient flows
|
2023 |
Lambert, Marc |
Reinforcement Learning in a Birth and Death Process: Breaking the Dependence on the State Space
|
2023 |
Anselmi, Jonatha |
projUNN: efficient method for training deep networks with unitary matrices
|
2023 |
Kiani, Bobak |
Multi-dataset Training of Transformers for Robust Action Recognition
|
2023 |
Liang, Junwei |
Recipe for a General, Powerful, Scalable Graph Transformer
|
2023 |
Rampášek, Ladislav |
Pure Transformers are Powerful Graph Learners
|
2023 |
Kim, Jinwoo |
The First Optimal Algorithm for Smooth and Strongly-Convex-Strongly-Concave Minimax Optimization
|
2023 |
Kovalev, Dmitry |
Sparse Interaction Additive Networks via Feature Interaction Detection and Sparse Selection
|
2023 |
Enouen, James |
PALBERT: Teaching ALBERT to Ponder
|
2023 |
Balagansky, Nikita |
Analyzing Data-Centric Properties for Graph Contrastive Learning
|
2023 |
Trivedi, Puja |
Exploring the Whole Rashomon Set of Sparse Decision Trees
|
2023 |
Xin, Rui |