PROVABLY EFFICIENT REINFORCEMENT LEARNING WITH KERNEL AND NEURAL FUNCTION APPROXIMATIONS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (34. : 2020 : Online) 34th Conference on Neural Information Processing Systems (NeurIPS 2020) ; Volume 17 of 27
1. Verfasser: Yang, Zhuoran (VerfasserIn)
Weitere Verfasser: Jin, Chi (VerfasserIn), Wang, Zhaoran (VerfasserIn), Wang, Mengdi (VerfasserIn), Jordan, Michael (VerfasserIn)
Pages:34
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
PC-PG: POLICY COVER DIRECTED EXPLORATION FOR PROVABLE POLICY GRADIENT LEARNING 2021 Agarwal, Alekh
GROUP CONTEXTUAL ENCODING FOR 3D POINT CLOUDS 2021 Liu, Xu
IS NORMALIZATION INDISPENSABLE FOR TRAINING DEEP NEURAL NETWORK? 2021 Shao, Jie
VARGRAD: A LOW-VARIANCE GRADIENT ESTIMATOR FOR VARIATIONAL INFERENCE 2021 Richter, Lorenz
MEMORY-EFFICIENT LEARNING OF STABLE LINEAR DYNAMICAL SYSTEMS FOR PREDICTION AND CONTROL 2021 Mamakoukas, Giorgos
NEUROSYMBOLIC TRANSFORMERS FOR MULTI-AGENT COMMUNICATION 2021 Inala, Jeevana Priya
CASPR: LEARNING CANONICAL SPATIOTEMPORAL POINT CLOUD REPRESENTATIONS 2021 Rempe, Davis
THE POTTS-ISING MODEL FOR DISCRETE MULTIVARIATE DATA 2021 Razaee, Zahra
UNDERSTANDING GRADIENT CLIPPING IN PRIVATE SGD: A GEOMETRIC PERSPECTIVE 2021 Chen, Xiangyi
LEARNING WITH OPERATOR-VALUED KERNELS IN REPRODUCING KERNEL KREIN SPACES 2021 Saha, Akash
CONSTANT-EXPANSION SUFFICES FOR COMPRESSED SENSING WITH GENERATIVE PRIORS 2021 Daskalakis, Constantinos
LEARNING SPARSE CODES FROM COMPRESSED REPRESENTATIONS WITH BIOLOGICALLY PLAUSIBLE LOCAL WIRING CONSTRAINTS 2021 Fallah, Kion
USING NOISE TO PROBE RECURRENT NEURAL NETWORK STRUCTURE AND PRUNE SYNAPSES 2021 Moore, Eli
INSTANCE-OPTIMALITY IN DIFFERENTIAL PRIVACY VIA APPROXIMATE INVERSE SENSITIVITY MECHANISMS 2021 Asi, Hilal
EFFICIENT MODEL-BASED REINFORCEMENT LEARNING THROUGH OPTIMISTIC POLICY SEARCH AND PLANNING 2021 Curi, Sebastian
PRACTICAL LOW-RANK COMMUNICATION COMPRESSION IN DECENTRALIZED DEEP LEARNING 2021 Vogels, Thijs
LATENT BANDITS REVISITED 2021 Hong, Joey
LINEAR TIME SINKHORN DIVERGENCES USING POSITIVE FEATURES 2021 Scetbon, Meyer
ADVERSARIAL COUNTERFACTUAL LEARNING AND EVALUATION FOR RECOMMENDER SYSTEM 2021 Xu, Da
EFFICIENT LEARNING OF DISCRETE GRAPHICAL MODELS 2021 Vuffray, Marc
Alle Artikel auflisten