WHEN TO USE PARAMETRIC MODELS IN REINFORCEMENT LEARNING?

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (33. : 2019 : Vancouver, British Columbia) 32nd Conference on Neural Information Processing Systems (NeurIPS 2019) ; Volume 18 of 20
1. Verfasser: Hasselt, Hado P. Van (VerfasserIn)
Weitere Verfasser: Hessel, Matteo (VerfasserIn), Aslanides, John (VerfasserIn)
Pages:32
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
RUDDER: RETURN DECOMPOSITION FOR DELAYED REWARDS 2020 Arjona-Medina, Jose A.
COMMUNICATION TRADE-OFFS FOR LOCAL-SGD WITH LARGE STEP SIZE 2020 Dieuleveut, Aymeric
EXPLANATIONS CAN BE MANIPULATED AND GEOMETRY IS TO BLAME 2020 Dombrowski, Ann-Kathrin
APPROXIMATING INTERACTIVE HUMAN EVALUATION WITH SELF-PLAY FOR OPEN-DOMAIN DIALOG SYSTEMS 2020 Ghandeharioun, Asma
LEARNING ABOUT AN EXPONENTIAL AMOUNT OF CONDITIONAL DISTRIBUTIONS 2020 Belghazi, Mohamed
RAND-NSG: FAST ACCURATE BILLION-POINT NEAREST NEIGHBOR SEARCH ON A SINGLE NODE 2020 Subramanya, Suhas Jayaram
LEARNING FAIRNESS IN MULTI-AGENT SYSTEMS 2020 Jiang, Jiechuan
PRIMAL-DUAL BLOCK GENERALIZED FRANK-WOLFE 2020 Lei, Qi
CALCULATING OPTIMISTIC LIKELIHOODS USING (GEODESICALLY) CONVEX OPTIMIZATION 2020 Nguyen, Viet Anh
CAN YOU TRUST YOUR MODEL'S UNCERTAINTY? EVALUATING PREDICTIVE UNCERTAINTY UNDER DATASET SHIFT 2020 Snoek, Jasper
USER-SPECIFIED LOCAL DIFFERENTIAL PRIVACY IN UNCONSTRAINED ADAPTIVE ONLINE LEARNING 2020 Hoeven, Dirk Van Der
USING A LOGARITHMIC MAPPING TO ENABLE LOWER DISCOUNT FACTORS IN REINFORCEMENT 2020 Seijen, Harm Van
LEARNING POSITIVE FUNCTIONS WITH PSEUDO MIRROR DESCENT 2020 Yang, Yingxiang
OUTLIER DETECTION AND ROBUST PCA USING A CONVEX MEASURE OF INNOVATION 2020 Rahmani, Mostafa
AN ALGORITHMIC FRAMEWORK FOR DIFFERENTIALLY PRIVATE DATA ANALYSIS ON TRUSTED PROCESSORS 2020 Allen, Joshua
TOWARDS HARDWARE-AWARE TRACTABLE LEARNING OF PROBABILISTIC MODELS 2020 Olascoaga, Laura I. Galindez
TOWARDS MODULAR AND PROGRAMMABLE ARCHITECTURE SEARCH 2020 Negrinho, Renato
USING EMBEDDINGS TO CORRECT FOR UNOBSERVED CONFOUNDING IN NETWORKS 2020 Veitch, Victor
ADVERSARIAL ROBUSTNESS THROUGH LOCAL LINEARIZATION 2020 Qin, Chongli
GOT: AN OPTIMAL TRANSPORT FRAMEWORK FOR GRAPH COMPARISON 2020 Maretic, Hermina Petric
Alle Artikel auflisten