IMPROVED REGRET BOUND AND EXPERIENCE REPLAY IN REGULARIZED POLICY ITERATION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (38. : 2021 : Online) International Conference on Machine Learning (ICML 2021 ; Part 8 of 16
1. Verfasser: LAZIC, NEVENA (VerfasserIn)
Weitere Verfasser: YIN, DONG (VerfasserIn), ABBASI-YADKORI, YASIN (VerfasserIn), SZEPESVARI, CSABA (VerfasserIn)
Pages:2021
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
EVALUATING ROBUSTNESS OF PREDICTIVE UNCERTAINTY ESTIMATION: ARE DIRICHLET-BASED MODELS RELIABLE? 2022 KOPETZKI, ANNA-KATHRIN
BOOSTING THE THROUGHPUT AND ACCELERATOR UTILIZATION OF SPECIALIZED CNN INFERENCE BEYOND INCREASING BATCH SIZE 2022 KOSAIAN, JACK
ACTIVE TESTING: SAMPLE-EFFICIENT MODEL EVALUATION 2022 KOSSEN, JANNIK
BAYESIAN STRUCTURAL ADAPTATION FOR CONTINUAL LEARNING 2022 KUMAR, ABHISHEK
ADAPTIVE NEWTON SKETCH: LINEAR-TIME OPTIMIZATION WITH QUADRATIC CONVERGENCE AND EFFECTIVE HESSIAN DIMENSIONALITY 2022 LACOTTE, JONATHAN
DISCOVERING SYMBOLIC POLICIES WITH DEEP REINFORCEMENT LEARNING 2022 LANDAJUELA, MIKEL
COUNTSKETCHES, FEATURE HASHING AND THE MEDIAN OF THREE 2022 LARSEN, KASPER GREEN
BETTER TRAINING USING WEIGHT-CONSTRAINED STOCHASTIC DYNAMICS 2022 LEIMKUHLER, BENEDICT
GRADIENT DISAGGREGATION: BREAKING PRIVACY IN FEDERATED LEARNING BY RECONSTRUCTING THE USER PARTICIPANT MATRIX 2022 LAM, MAXIMILIAN
MORPHVAE: GENERATING NEURAL MORPHOLOGIES FROM 3D-WALKS USING A VARIATIONAL AUTOENCODER WITH SPHERICAL LATENT SPACE 2022 LATURNUS, SOPHIE
IMPROVED REGRET BOUND AND EXPERIENCE REPLAY IN REGULARIZED POLICY ITERATION 2022 LAZIC, NEVENA
GAUSSIAN PROCESS-BASED REAL-TIME LEARNING FOR SAFETY CRITICAL APPLICATIONS 2022 LEDERER, ARMIN
LEARNING TO PRICE AGAINST A MOVING TARGET 2022 LEME, RENATO PAES
GLOBALLY-ROBUST NEURAL NETWORKS 2022 LEINO, KLAS
SHARING LESS IS MORE: LIFELONG LEARNING IN DEEP NETWORKS WITH SELECTIVE LAYER TRANSFER 2022 LEE, SEUNGWON
UNSUPERVISED EMBEDDING ADAPTATION VIA EARLY-STAGE FEATURE RECONSTRUCTION FOR FEW-SHOT CLASSIFICATION 2022 LEE, DONG HOON
SCALABLE EVALUATION OF MULTI-AGENT REINFORCEMENT LEARNING WITH MELTING POT 2022 LEIBO, JOEL Z.
STRATEGIC CLASSIFICATION MADE PRACTICAL 2022 LEVANON, SAGI
WINOGRAD ALGORITHM FOR ADDERNET 2022 LI, WENSHUO
A LOWER BOUND FOR THE SAMPLE COMPLEXITY OF INVERSE REINFORCEMENT LEARNING 2022 KOMANDURU, ABI
Alle Artikel auflisten