VARIANCE-REDUCED OFF-POLICY TDC LEARNING: NON-ASYMPTOTIC CONVERGENCE ANALYSIS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (34. : 2020 : Online) 34th Conference on Neural Information Processing Systems (NeurIPS 2020) ; Volume 18 of 27
1. Verfasser: Ma, Shaocong (VerfasserIn)
Weitere Verfasser: Zhou, Yi (VerfasserIn), Zou, Shaofeng (VerfasserIn)
Pages:34
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
GRADAUG: A NEW REGULARIZATION METHOD FOR DEEP NEURAL NETWORKS 2021 Yang, Taojiannan
TRANSFER LEARNING VIA L1 REGULARIZATION 2021 Takada, Masaaki
LIFELONG POLICY GRADIENT LEARNING OF FACTORED POLICIES FOR FASTER TRAINING WITHOUT FORGETTING 2021 Mendez, Jorge
FAST GEOMETRIC LEARNING WITH SYMBOLIC MATRICES 2021 Feydy, Jean
PLANNING WITH GENERAL OBJECTIVE FUNCTIONS: GOING BEYOND TOTAL REWARDS 2021 Wang, Ruosong
ASSISTED LEARNING: A FRAMEWORK FOR MULTI-ORGANIZATION LEARNING 2021 Xian, Xun
ELECTION CODING FOR DISTRIBUTED LEARNING: PROTECTING SIGNSGD AGAINST BYZANTINE ATTACKS 2021 Sohn, Jy-Yong
TASK-ORIENTED FEATURE DISTILLATION 2021 Zhang, Linfeng
IMPLICIT RANK-MINIMIZING AUTOENCODER 2021 Jing, Li
ENTROPIC CAUSAL INFERENCE: IDENTIFIABILITY AND FINITE SAMPLE RESULTS 2021 Compton, Spencer
REWRITING HISTORY WITH INVERSE RL: HINDSIGHT INFERENCE FOR POLICY IMPROVEMENT 2021 Eysenbach, Ben
STEER : SIMPLE TEMPORAL REGULARIZATION FOR NEURAL ODE 2021 Ghosh, Arnab
RECURRENT SWITCHING DYNAMICAL SYSTEMS MODELS FOR MULTIPLE INTERACTING NEURAL POPULATIONS 2021 Glaser, Joshua
EFFICIENT CLUSTERING BASED ON A UNIFIED VIEW OF K-MEANS AND RATIO-CUT 2021 Pei, Shenfei
GENERALIZED INDEPENDENT NOISE CONDITION FOR ESTIMATING LATENT VARIABLE CAUSAL GRAPHS 2021 Xie, Feng
LEARNING TO SELECT BEST FORECAST TASKS FOR CLINICAL OUTCOME PREDICTION 2021 Xue, Yuan
STOCHASTIC OPTIMIZATION WITH HEAVY-TAILED NOISE VIA ACCELERATED GRADIENT CLIPPING 2021 Gorbunov, Eduard
GOCOR: BRINGING GLOBALLY OPTIMIZED CORRESPONDENCE VOLUMES INTO YOUR NEURAL NETWORK 2021 Truong, Prune
PREDICTION WITH CORRUPTED EXPERT ADVICE 2021 Amir, Idan
POINT PROCESS MODELS FOR SEQUENCE DETECTION IN HIGH-DIMENSIONAL NEURAL SPIKE TRAINS 2021 Williams, Alex
Alle Artikel auflisten