LOGARITHMIC REGRET FOR REINFORCEMENT LEARNING WITH LINEAR FUNCTION APPROXIMATION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (38. : 2021 : Online) International Conference on Machine Learning (ICML 2021 ; Part 6 of 16
1. Verfasser: HE, JIAFAN (VerfasserIn)
Weitere Verfasser: ZHOU, DONGRUO (VerfasserIn), GU, QUANQUAN (VerfasserIn)
Pages:2021
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
ADVERSARIAL COMBINATORIAL BANDITS WITH GENERAL NON-LINEAR REWARD FUNCTIONS 2022 CHEN, XI
EQUIVARIANT LEARNING OF STOCHASTIC FIELDS: GAUSSIAN PROCESSES AND STEERABLE CONDITIONAL NEURAL PROCESSES 2022 HOLDERRIETH, PETER
A NOVEL SEQUENTIAL CORESET METHOD FOR GRADIENT DESCENT ALGORITHMS 2022 HUANG, JIAWEI
FL-NTK: A NEURAL TANGENT KERNEL-BASED FRAMEWORK FOR FEDERATED LEARNING ANALYSIS 2022 HUANG, BAIHE
A RIEMANNIAN BLOCK COORDINATE DESCENT METHOD FOR COMPUTING THE PROJECTION ROBUST WASSERSTEIN DISTANCE 2022 HUANG, MINHUI
ACCURATE POST TRAINING QUANTIZATION WITH SMALL CALIBRATION SETS 2022 HUBARA, ITAY
MODEL PERFORMANCE SCALING WITH MULTIPLE DATA SOURCES 2022 HASHIMOTO, TATSUNORI
IMPROVING MOLECULAR GRAPH NEURAL NETWORK EXPLAINABILITY WITH ORTHONORMALIZATION AND INDUCED SPARSITY 2022 HENDERSON, RYAN
MUESLI: COMBINING IMPROVEMENTS IN POLICY OPTIMIZATION 2022 HESSEL, MATTEO
MULTIPLICATIVE NOISE AND HEAVY TAILS IN STOCHASTIC OPTIMIZATION 2022 HODGKINSON, LIAM
LEARNING AND PLANNING IN COMPLEX ACTION SPACES 2022 HUBERT, THOMAS
LIETRANSFORMER: EQUIVARIANT SELF-ATTENTION FOR LIE GROUPS 2022 HUTCHINSON, MICHAEL
INSTANCE-OPTIMAL COMPRESSED SENSING VIA POSTERIOR SAMPLING 2022 JALAL, AJIL
IMPROVED REGRET BOUNDS OF BILINEAR BANDITS USING ACTION SPACE ANALYSIS 2022 JANG, KYOUNGSEOK
OBJECTIVE BOUND CONDITIONAL GAUSSIAN PROCESS FOR BAYESIAN OPTIMIZATION 2022 JEONG, TAEWON
REGRET MINIMIZATION IN STOCHASTIC NON-CONVEX LEARNING VIA A PROXIMAL-GRADIENT APPROACH 2022 HALLAK, NADAV
PipeTransformer: Automated Elastic Pipelining for Distributed Training of Large-scale Models 2022 He, Chaoyang
THE LIMITS OF MIN-MAX OPTIMIZATION ALGORITHMS: CONVERGENCE TO SPURIOUS NON-CRITICAL SETS 2022 HSIEH, YA-PING
ON RECOVERING FROM MODELING ERRORS USING TESTING BAYESIAN NETWORKS 2022 HUANG, HAIYING
SCALABLE MARGINAL LIKELIHOOD ESTIMATION FOR MODEL SELECTION IN DEEP LEARNING 2022 IMMER, ALEXANDER
Alle Artikel auflisten