OPTIMISTIC POLICY OPTIMIZATION WITH BANDIT FEEDBACK

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (37. : 2020 : Online) 37th International Conference on Machine Learning (ICML 2020) ; Part 12 of 15
1. Verfasser: Efroni, Y. (VerfasserIn)
Weitere Verfasser: Shani, L. (VerfasserIn), Rosenberg, A. (VerfasserIn), Mannor, S. (VerfasserIn)
Pages:37
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
EVALUATING MACHINE ACCURACY ON IMAGENET 2021 Shankar, V.
ADAPTIVE SAMPLING FOR ESTIMATING PROBABILITY DISTRIBUTIONS 2021 Shekhar, S.
CAUSAL STRATEGIC LINEAR REGRESSION 2021 Shavit, Yonadav
EDUCATING TEXT AUTOENCODERS: LATENT REPRESENTATION GUIDANCE VIA DENOISING 2021 Shen, T.
LANDSCAPE CONNECTIVITY AND DROPOUT STABILITY OF SGD SOLUTIONS FOR OVER- PARAMETERIZED NEURAL NETWORK 2021 Shevchenko, A.
INCREMENTAL SAMPLINGWITHOUT REPLACEMENT FOR SEQUENCE MODELS 2021 Shi, K.
DISTRIBUTIONALLY ROBUST POLICY EVALUATION AND LEARNING IN OFFLINE CONTEXTUAL BANDITS 2021 Si, Nian
SECOND-ORDER PROVABLE DEFENSES AGAINST ADVERSARIAL ATTACKS 2021 Singla, Sahil
STRUCTURED LINEAR CONTEXTUAL BANDITS: A SHARP AND GEOMETRIC SMOOTHED ANALYSIS 2021 Sivakumar, Vidyashankar
BRIDGING THE GAP BETWEEN F-GANS AND WASSERSTEIN GANS 2021 Song, Jiaming
WHICH TASKS SHOULD BE LEARNED TOGETHER IN MULTI-TASK LEARNING? 2021 Standley, Trevor
RESPONSIVE SAFETY IN REINFORCEMENT LEARNING BY PID LAGRANGIAN METHODS 2021 Stooke, Adam
LEARNING DISCRETE STRUCTURED REPRESENTATIONS BY ADVERSARIALLY MAXIMIZING MUTUAL INFORMATION 2021 Stratos, Karl
CONFIDENCE-CALIBRATED ADVERSARIAL TRAINING: GENERALIZING TO UNSEEN ATTACKS 2021 Stutz, David
TASK UNDERSTANDING FROM CONFUSING MULTI-TASK DATA 2021 Su, Xin
CONQUR: MITIGATING DELUSIONAL BIAS IN DEEP Q-LEARNING 2021 Su, DiJia (Andy)
MULTI-AGENT ROUTING VALUE ITERATION NETWORK 2021 Sykora, Quinlan
THE K-TIED NORMAL DISTRIBUTION: A COMPACT PARAMETERIZATION OF GAUSSIAN MEAN FIELD POSTERIORS IN BAYESIAN NEURAL NETWORKS 2021 Swiatkowski, Jakub
AN EXPLICITLY RELATIONAL NEURAL NETWORK ARCHITECTURE 2021 Shanahan, M.
CONTROLVAE: CONTROLLABLE VARIATIONAL AUTOENCODER 2021 Shao, H.
Alle Artikel auflisten