IN SEARCH OF ROBUST MEASURES OF GENERALIZATION
|
2021 |
Dziugaite, Gintare Karolina |
ONLINE DECISION BASED VISUAL TRACKING VIA REINFORCEMENT LEARNING
|
2021 |
Song, Ke |
LEARNING IMPLICIT CREDIT ASSIGNMENT FOR COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING
|
2021 |
Zhou, Meng |
MATE: PLUGGING IN MODEL AWARENESS TO TASK EMBEDDING FOR META LEARNING
|
2021 |
Chen, Xiaohan |
ROBUST AND HEAVY-TAILED MEAN ESTIMATION MADE SIMPLE, VIA REGRET MINIMIZATION
|
2021 |
Hopkins, Sam |
COUNTEREXAMPLE-GUIDED LEARNING OF MONOTONIC NEURAL NETWORKS
|
2021 |
Sivaraman, Aishwarya |
TRAINING GENERATIVE ADVERSARIAL NETWORKS WITH LIMITED DATA
|
2021 |
Karras, Tero |
FRACTRAIN: FRACTIONALLY SQUEEZING BIT SAVINGS BOTH TEMPORALLY AND SPATIALLY FOR EFFICIENT DNN TRAINING
|
2021 |
Fu, Yonggan |
PROJECTION EFFICIENT SUBGRADIENT METHOD AND OPTIMAL NONSMOOTH FRANK-WOLFE METHOD
|
2021 |
Thekumparampil, Kiran K. |
YOUR GAN IS SECRETLY AN ENERGY-BASED MODEL AND YOU SHOULD USE DISCRIMINATOR DRIVEN LATENT SAMPLING
|
2021 |
Che, Tong |
ADVERSARIAL SOFT ADVANTAGE FITTING: IMITATION LEARNING WITHOUT POLICY OPTIMIZATION
|
2021 |
Barde, Paul |
AGRFEE TO DISAGREE: ADAPTIVE ENSEMBLE KNOWLEDGE DISTILLATION IN GRADIENT SPACE
|
2021 |
Du, Shangchen |
THE WASSERSTEIN PROXIMAL GRADIENT ALGORITHM
|
2021 |
Salim, Adıl |
UNIVERSALLY QUANTIZED NEURAL COMPRESSION
|
2021 |
Agustsson, Eirikur |
OFF-POLICY IMITATION LEARNING FROM OBSERVATIONS
|
2021 |
Zhu, Zhuangdi |
A MAXIMUM-ENTROPY APPROACH TO OFF-POLICY EVALUATION IN AVERAGE-REWARD MDPS
|
2021 |
Lazic, Nevena |
ADAPTIVE LEARNED BLOOM FILTER (ADA-BF): EFFICIENT UTILIZATION OF THE CLASSIFIER WITH APPLICATION TO REAL-TIME INFORMATION FILTERING ON THE WEB
|
2021 |
Dai, Zhenwei |
MCUNET: TINY DEEP LEARNING ON IOT DEVICES
|
2021 |
Lin, Ji |
PROVABLY EFFICIENT REWARD-AGNOSTIC NAVIGATION WITH LINEAR VALUE ITERATION
|
2021 |
Zanette, Andrea |
CSI: NOVELTY DETECTION VIA CONTRASTIVE LEARNING ON DISTRIBUTIONALLY SHIFTED INSTANCES
|
2021 |
Tack, Jihoon |