ADAPTIVE TEMPORAL-DIFFERENCE LEARNING FOR POLICY EVALUATION WITH PER-STATE UNCERTAINTY ESTIMATES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (33. : 2019 : Vancouver, British Columbia) 32nd Conference on Neural Information Processing Systems (NeurIPS 2019) ; Volume 15 of 20
1. Verfasser: Riquelme, Carlos (VerfasserIn)
Weitere Verfasser: Penedones, Hugo (VerfasserIn), Vincent, Damien (VerfasserIn), Maennel, Hartmut (VerfasserIn), Gelly, Sylvain (VerfasserIn), Mann, Timothy A. (VerfasserIn), Barreto, Andre (VerfasserIn), Neu, Gergely (VerfasserIn)
Pages:32
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
MACHINE TEACHING OF ACTIVE SEQUENTIAL LEARNERS 2020 Peltola, Tomi
UNLABELED DATA IMPROVES ADVERSARIAL ROBUSTNESS 2020 Carmon, Yair
META ARCHITECTURE SEARCH 2020 Shaw, Albert
DISTRIBUTION OBLIVIOUS, RISK-AWARE ALGORITHMS FOR MULTI-ARMED BANDITS WITH UNBOUNDED REWARDS 2020 Kagrecha, Anmol
PROVABLY ROBUST DEEP LEARNING VIA ADVERSARIALLY TRAINED SMOOTHED CLASSIFIERS 2020 Salman, Hadi
VARIANCE REDUCTION FOR MATRIX GAMES 2020 Carmon, Yair
A DIRECT TILDE{O}(1/EPSILON) ITERATION PARALLEL ALGORITHM FOR OPTIMAL TRANSPORT 2020 Jambulapati, Arun
LEARNING NEURAL NETWORKS WITH ADAPTIVE REGULARIZATION 2020 Zhao, Han
PROVABLE NON-LINEAR INDUCTIVE MATRIX COMPLETION 2020 Zhong, Kai
COMMUNICATION-EFFICIENT DISTRIBUTED BLOCKWISE MOMENTUM SGD WITH ERROR-FEEDBACK 2020 Zheng, Shuai
IDENTIFICATION OF CONDITIONAL CAUSAL EFFECTS UNDER MARKOV EQUIVALENCE 2020 Jaber, Amin
NECESSARY AND SUFFICIENT GEOMETRIES FOR GRADIENT METHODS 2020 Levy, Daniel
LANDMARK ORDINAL EMBEDDING 2020 Ghosh, Nikhil
GLOBAL GUARANTEES FOR BLIND DEMODULATION WITH GENERATIVE PRIORS 2020 Hand, Paul
GEOMETRY-AWARE NEURAL RENDERING 2020 Tobin, Joshua
A ZERO-POSITIVE LEARNING APPROACH FOR DIAGNOSING SOFTWARE PERFORMANCE REGRESSIONS 2020 Alam, Mejbah
DTWNET: A DYNAMIC TIME WARPING NETWORK 2020 Cai, Xingyu
STRUCTURED GRAPH LEARNING VIA LAPLACIAN SPECTRAL CONSTRAINT 2020 Kumar, Sandeep
RETHINKING KERNEL METHODS FOR NODE REPRESENTATION LEARNING ON GRAPHS 2020 Tian, Yu
META-INVERSE REINFORCEMENT LEARNING WITH PROBABILISTIC CONTEXT VARIABLES 2020 Yu, Lantao
Alle Artikel auflisten