DIRECT POLICY GRADIENTS: DIRECT OPTIMIZA TION OF POLICIES IN DISCRETE ACTION SPACES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (34. : 2020 : Online) 34th Conference on Neural Information Processing Systems (NeurIPS 2020) ; Volume 22 of 27
1. Verfasser: Lorberbom, Guy (VerfasserIn)
Weitere Verfasser: Maddison, Chris J. (VerfasserIn), Heess, Nicolas (VerfasserIn), Hazan, Tamir (VerfasserIn), Tarlow, Daniel (VerfasserIn)
Pages:34
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
PRUNING FILTER IN FILTER 2021 Meng, Fanxu
ARMA NETS: EXPANDING RECEPTIVE FIELD FOR DENSE PREDICTION 2021 Su, Jiahao
DIVERSITY-GUIDED MULTI-OBJECTIVE BAYESIAN OPTIMIZATION WITH BATCH EVALUATIONS 2021 Lukovic, Mina Konakovic
ON REWARD-FREE REINFORCEMENT LEARNING WITH LINEAR FUNCTION APPROXIMATION 2021 Wang, Ruosong
LEARNING OUTSIDE THE BLACK-BOX: THE PURSUIT OF INTERPRETABLE MODELS 2021 Crabbe, Jonathan
BREAKING REVERSIBILITY ACCELERATES LANGEVIN DYNAMICS FOR NON-CONVEX OPTIMIZATION 2021 Gao, Xuefeng
DIGRAPH INCEPTION CONVOLUTIONAL NETWORKS 2021 Tong, Zekun
FAIR MULTIPLE DECISION MAKING THROUGH SOFT INTERVENTIONS 2021 Hu, Yaowei
INVERSE LEARNING OF SYMMETRIES 2021 Wieser, Mario
EFFICIENT NONMYOPIC BA YESIAN OPTIMIZATION VIA ONE-SHOT MULTI-STEP TREES 2021 Jiang, Shali
HYBRID MODELS FOR LEARNING TO BRANCH 2021 Gupta, Prateek
WOODFISHER: EFFICIENT SECOND-ORDER APPROXIMATION FOR NEURAL NETWORK COMPRESSION 2021 Singh, Sidak Pal
LEARNING TO PROVE THEOREMS BY LEARNING TO GENERATE THEOREMS 2021 Wang, Mingzhe
TRUTHFUL DATA ACQUISITION VIA PEER PREDICTION 2021 Chen, Yiling
WHAT DID YOU THINK WOULD HAPPEN? EXPLAINING AGENT BEHAVIOUR THROUGH INTENDED OUTCOMES 2021 Yau, Herman
CONTINUOUS REGULARIZED WASSERSTEIN BARYCENTERS 2021 Li, Lingxiao
AXIOMS FOR LEARNING FROM PAIRWISE COMPARISONS 2021 Noothigattu, Ritesh
FEWER IS MORE: A DEEP GRAPH METRIC LEARNING PERSPECTIVE USING FEWER PROXIES 2021 Zhu, Yuehua
REPLICA-EXCHANGE NOS\VE-HOOVER DYNAMICS FOR BAYESIAN LEARNING ON LARGE DATASETS 2021 Luo, Rui
NEURAL ANISOTROPY DIRECTIONS 2021 Ortiz-Jimenez, Guillermo
Alle Artikel auflisten