DISTRIBUTIONALLY ROBUST POLICY EVALUATION AND LEARNING IN OFFLINE CONTEXTUAL BANDITS

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (37. : 2020 : Online) 37th International Conference on Machine Learning (ICML 2020) ; Part 12 of 15
1. Verfasser: Si, Nian (VerfasserIn)
Weitere Verfasser: Zhang, Fan (VerfasserIn), Zhou, Zhengyuan (VerfasserIn), Blanchet, Jose (VerfasserIn)
Pages:37
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
OPTIMISTIC POLICY OPTIMIZATION WITH BANDIT FEEDBACK 2021 Efroni, Y.
NEURAL KERNELS WITHOUT TANGENTS 2021 Shankar, V.
LEARNING ROBOT SKILLS WITH TEMPORAL VARIATIONAL INFERENCE 2021 Shankar, Tanmay
POWERNORM: RETHINKING BATCH NORMALIZATION IN TRANSFORMERS 2021 Shen, Sheng
EXTREME MULTI-LABEL CLASSIFICATION FROM AGGREGATED LABELS 2021 Shen, Y.
MESSAGE PASSING LEAST SQUARES FRAMEWORK AND ITS APPLICATION TO ROTATION SYNCHRONIZATION 2021 Shi, Yunpeng
A GRAPH TO GRAPHS FRAMEWORK FOR RETROSYNTHESIS PREDICTION 2021 Shi, Chence
PREDICTIVE CODING FOR LOCALLY-LINEAR CONTROL 2021 Shu, Rui
A MARKOV DECISION PROCESS MODEL FOR SOCIO-ECONOMIC SYSTEMS IMPACTED BY CLIMATE CHANGE 2021 Shuvo, Salman Sadiq
LEARNING FAIR POLICIES IN MULTIOBJECTIVE (DEEP) REINFORCEMENT LEARNING WITH AVERGAGE AND DISCOUNTED REWARDS 2021 Siddique, Umer
A GENERATIVE MODEL FOR MOLECULAR DISTANCE GEOMETRY 2021 Simm, Gregor N. C.
COLLABORATIVE MACHINE LEARNING WITH INCENTIVE-AWARE MODEL REWARDS 2021 Sim, Rachael Hwee Ling
ADAPTIVE ESTIMATOR SELECTION FOR OFF-POLICY EVALUATION 2021 Krishnamurthy, Akshay
TEST-TIME TRAINING WITH SELF-SUPERVISION FOR GENERALIZATION UNDER DISTRIBUTION SHIFTS 2021 Sun, Yu
THE MANY SHAPLEY VALUES FOR MODEL EXPLANATION 2021 Sundararajan, Mukund
FIEDLER REGULARIZATION: LEARNING NEURAL NETWORKS WITH GRAPH SPARSITY 2021 Tam, Edric
CHANNEL EQUILIBRIUM NETWORKS FOR LEARNING DEEP REPRESENTATION 2021 Shao, Wenqi
LOOKAHEAD-BOUNDED Q-LEARNING 2021 Shar, Ibrahim El
LEARNING FOR DOSE ALLOCATION IN ADAPTIVE CLINICAL TRIALS WITH SAFETY CONSTRAINTS 2021 Shen, Cong
ONE-SHOT DISTRIBUTED RIDGE REGRESSION IN HIGH DIMENSIONS 2021 Dobriban, Edgar
Alle Artikel auflisten