Inductive Biases and Variable Creation in Self-Attention Mechanisms
|
2023 |
Edelman, Benjamin L. |
Understanding Dataset Difficulty with V-Usable Information
|
2023 |
Ethayarajh, Kawin |
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning
|
2023 |
Evci, Utku |
An Equivalence Between Data Poisoning and Byzantine Gradient Attacks
|
2023 |
Farhadkhani, Sadegh |
Investigating Generalization by Controlling Normalized Margin
|
2023 |
Farhang, Alexander R. |
Matching Structure for Dual Learning
|
2023 |
Fei, Hao |
Kernelized Multiplicative Weights for 0/1-Polyhedral Games: Bridging the Gap Between Learning in Extensive-Form and Normal-Form Games
|
2023 |
Farina, Gabriele |
Principled Knowledge Extrapolation with GANs
|
2023 |
Feng, Ruili |
Coordinated Double Machine Learning
|
2023 |
Fingerhut, Nitai |
Conformal Prediction Sets with Limited False Positives
|
2023 |
Fisch, Adam |
Fast Relative Entropy Coding with A* Coding
|
2023 |
Flamich, Gergely |
Label Ranking Through Nonparametric Regression
|
2023 |
Fotakis, Dimitris |
DRIBO: Robust Deep Reinforcement Learning Via Multi-View Information Bottleneck
|
2023 |
Fan, Jiameng |
Training Discrete Deep Generative Models Via Gapped Straight-Through Estimator
|
2023 |
Fan, Ting-Han |
Variational Wasserstein Gradient Flow
|
2023 |
Fan, Jiaojiao |
Byzantine Machine Learning Made Easy by Resilient Averaging of Momentums
|
2023 |
Farhadkhani, Sadegh |
Bayesian Continuous-Time Tucker Decomposition
|
2023 |
Fang, Shikai |
Cascaded Gaps: Towards Logarithmic Regret for Risk-Sensitive Reinforcement Learning
|
2023 |
Fei, Yingjie |
A Resilient Distributed Boosting Algorithm
|
2023 |
Filmus, Yuval |
An Intriguing Property of Geophysics Inversion
|
2023 |
Feng, Yinan |