GRADIENTDICE: RETHINKING GENERALIZED OFFLINE ESTIMATIONOF STATIONARY VALUES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (37. : 2020 : Online) 37th International Conference on Machine Learning (ICML 2020) ; Part 15 of 15
1. Verfasser: Zhang, Shangtong (VerfasserIn)
Weitere Verfasser: Liu, Bo (VerfasserIn), Whiteson, Shimon (VerfasserIn)
Pages:37
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
TRAINING DEEP ENERGY-BASED MODELS WITH F-DIVERGENCE MINIMIZATION 2021 Yu, L.
DESIGNING OPTIMAL DYNAMIC TREATMENT REGIMES: A CAUSAL REINFORCEMENT LEARNING APPROACH 2021 Zhang, Junzhe
OPTIMAL ESTIMATOR FOR UNLABELED LINEAR REGRESSION 2021 Zhang, H.
SELF-ATTENTIVE HAWKES PROCESS 2021 Zhang, Qiang
ADAPTIVE REWARD-POISONING ATTACKS AGAINST REINFORCEMENT LEARNING 2021 Zhang, Xuezhou
SPARSIFIED LINEAR PROGRAMMING FOR ZERO-SUM EQUILIBRIUM FINDING 2021 Zhang, Brian Hu
FEATURE QUANTIZATION IMPROVES GAN TRAINING 2021 Zhao, Yang
DO RNN AND LSTM HAVE LONG MEMORY? 2021 Zhao, Jingyu
CAUSAL EFFECT ESTIMATION AND OPTIMAL DOSE SUGGESTIONS IN MOBILE HEALTH INTERNATIONAL CONFERENCE ON MACHINE LEARNING (ICML 2020) 2021 Zhu, Liangyu
LEARNING ADVERSARIALLY ROBUST REPRESENTATIONS VIA WORST-CASE MUTUAL INFORMATION MAXIMIZATION 2021 Zhu, Sicheng
LAPLACIAN REGULARIZED FEW-SHOT LEARNING 2021 Ziko, Imtiaz Masud
GRAPH CONVOLUTIONAL NETWORK FOR RECOMMENDATION WITH LOW-PASS COLLABORATIVE FILTERS 2021 Yu, Wenhui
GRAPH RANDOM NEURAL FEATURES FOR DISTANCE-PRESERVING GRAPH REPRESENTATIONS 2021 Zambon, Daniele
LEARNING NEAR OPTIMAL POLICIES WITH LOW INHERENT BELLMAN ERROR 2021 Zanette, A.
GENERATIVE ADVERSARIAL IMITATION LEARNING WITH NEURAL NETWORK PARAMETERIZATION: GLOBAL OPTIMALITY AND CONVERGENCE RATE 2021 Zhang, Y.
SPREAD DIVERGENCE 2021 Zhang, M.
MIX-N-MATCH: ENSEMBLE AND COMPOSITIONAL METHODS FOR UNCERTAINTY CALIBRATION IN DEEP LEARNING 2021 Zhang, Jize
GRADIENTDICE: RETHINKING GENERALIZED OFFLINE ESTIMATIONOF STATIONARY VALUES 2021 Zhang, Shangtong
PROVABLY CONVERGENT TWO-TIMESCALE OFF-POLICY ACTOR-CRITIC WITH FUNCTION APPROXIMATION 2021 Zhang, Shangtong
CONVEX CALIBRATED SURROGATES FOR THE MULTI-LABEL F-MEASURE 2021 Zhang, Mingyuan
Alle Artikel auflisten