RUDDER: RETURN DECOMPOSITION FOR DELAYED REWARDS
|
2020 |
Arjona-Medina, Jose A. |
COMMUNICATION TRADE-OFFS FOR LOCAL-SGD WITH LARGE STEP SIZE
|
2020 |
Dieuleveut, Aymeric |
EXPLANATIONS CAN BE MANIPULATED AND GEOMETRY IS TO BLAME
|
2020 |
Dombrowski, Ann-Kathrin |
APPROXIMATING INTERACTIVE HUMAN EVALUATION WITH SELF-PLAY FOR OPEN-DOMAIN DIALOG SYSTEMS
|
2020 |
Ghandeharioun, Asma |
LEARNING ABOUT AN EXPONENTIAL AMOUNT OF CONDITIONAL DISTRIBUTIONS
|
2020 |
Belghazi, Mohamed |
RAND-NSG: FAST ACCURATE BILLION-POINT NEAREST NEIGHBOR SEARCH ON A SINGLE NODE
|
2020 |
Subramanya, Suhas Jayaram |
LEARNING FAIRNESS IN MULTI-AGENT SYSTEMS
|
2020 |
Jiang, Jiechuan |
PRIMAL-DUAL BLOCK GENERALIZED FRANK-WOLFE
|
2020 |
Lei, Qi |
CALCULATING OPTIMISTIC LIKELIHOODS USING (GEODESICALLY) CONVEX OPTIMIZATION
|
2020 |
Nguyen, Viet Anh |
CAN YOU TRUST YOUR MODEL'S UNCERTAINTY? EVALUATING PREDICTIVE UNCERTAINTY UNDER DATASET SHIFT
|
2020 |
Snoek, Jasper |
USER-SPECIFIED LOCAL DIFFERENTIAL PRIVACY IN UNCONSTRAINED ADAPTIVE ONLINE LEARNING
|
2020 |
Hoeven, Dirk Van Der |
USING A LOGARITHMIC MAPPING TO ENABLE LOWER DISCOUNT FACTORS IN REINFORCEMENT
|
2020 |
Seijen, Harm Van |
LEARNING POSITIVE FUNCTIONS WITH PSEUDO MIRROR DESCENT
|
2020 |
Yang, Yingxiang |
OUTLIER DETECTION AND ROBUST PCA USING A CONVEX MEASURE OF INNOVATION
|
2020 |
Rahmani, Mostafa |
ALLEVIATING LABEL SWITCHING WITH OPTIMAL TRANSPORT
|
2020 |
Monteiller, Pierre |
PARAPHRASE GENERATION WITH LATENT BAG OF WORDS
|
2020 |
Fu, Yao |
A NEW DISTRIBUTION ON THE SIMPLEX WITH AUTO-ENCODING APPLICATIONS
|
2020 |
Stirn, Andrew |
AUTOPRUNE: AUTOMATIC NETWORK PRUNING BY REGULARIZING AUXILIARY PARAMETERS
|
2020 |
Xiao, Xia |
A NEURALLY PLAUSIBLE MODEL LEARNS SUCCESSOR REPRESENTATIONS IN PARTIALLY OBSERVABLE ENVIROMENTS
|
2020 |
Vértes, Eszter |
ON ROBUSTNESS TO ADVERSARIAL EXAMPLES AND POLYNOMIAL OPTIMIZATION
|
2020 |
Awasthi, Pranjal |