SAMPLE EFFICIENT REINFORCEMENT LEARNING VIA LOW-RANK MATRIX ESTIMATION

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (34. : 2020 : Online) 34th Conference on Neural Information Processing Systems (NeurIPS 2020) ; Volume 15 of 27
1. Verfasser: Shah, Devavrat (VerfasserIn)
Weitere Verfasser: Song, Dogyoon (VerfasserIn), Xu, Zhi (VerfasserIn), Yang, Yuzhe (VerfasserIn)
Pages:34
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
IN SEARCH OF ROBUST MEASURES OF GENERALIZATION 2021 Dziugaite, Gintare Karolina
ONLINE DECISION BASED VISUAL TRACKING VIA REINFORCEMENT LEARNING 2021 Song, Ke
LEARNING IMPLICIT CREDIT ASSIGNMENT FOR COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING 2021 Zhou, Meng
MATE: PLUGGING IN MODEL AWARENESS TO TASK EMBEDDING FOR META LEARNING 2021 Chen, Xiaohan
ROBUST AND HEAVY-TAILED MEAN ESTIMATION MADE SIMPLE, VIA REGRET MINIMIZATION 2021 Hopkins, Sam
COUNTEREXAMPLE-GUIDED LEARNING OF MONOTONIC NEURAL NETWORKS 2021 Sivaraman, Aishwarya
TRAINING GENERATIVE ADVERSARIAL NETWORKS WITH LIMITED DATA 2021 Karras, Tero
FRACTRAIN: FRACTIONALLY SQUEEZING BIT SAVINGS BOTH TEMPORALLY AND SPATIALLY FOR EFFICIENT DNN TRAINING 2021 Fu, Yonggan
PROJECTION EFFICIENT SUBGRADIENT METHOD AND OPTIMAL NONSMOOTH FRANK-WOLFE METHOD 2021 Thekumparampil, Kiran K.
YOUR GAN IS SECRETLY AN ENERGY-BASED MODEL AND YOU SHOULD USE DISCRIMINATOR DRIVEN LATENT SAMPLING 2021 Che, Tong
ADVERSARIAL SOFT ADVANTAGE FITTING: IMITATION LEARNING WITHOUT POLICY OPTIMIZATION 2021 Barde, Paul
AGRFEE TO DISAGREE: ADAPTIVE ENSEMBLE KNOWLEDGE DISTILLATION IN GRADIENT SPACE 2021 Du, Shangchen
THE WASSERSTEIN PROXIMAL GRADIENT ALGORITHM 2021 Salim, Adıl
UNIVERSALLY QUANTIZED NEURAL COMPRESSION 2021 Agustsson, Eirikur
OFF-POLICY IMITATION LEARNING FROM OBSERVATIONS 2021 Zhu, Zhuangdi
A MAXIMUM-ENTROPY APPROACH TO OFF-POLICY EVALUATION IN AVERAGE-REWARD MDPS 2021 Lazic, Nevena
ADAPTIVE LEARNED BLOOM FILTER (ADA-BF): EFFICIENT UTILIZATION OF THE CLASSIFIER WITH APPLICATION TO REAL-TIME INFORMATION FILTERING ON THE WEB 2021 Dai, Zhenwei
MCUNET: TINY DEEP LEARNING ON IOT DEVICES 2021 Lin, Ji
PROVABLY EFFICIENT REWARD-AGNOSTIC NAVIGATION WITH LINEAR VALUE ITERATION 2021 Zanette, Andrea
CSI: NOVELTY DETECTION VIA CONTRASTIVE LEARNING ON DISTRIBUTIONALLY SHIFTED INSTANCES 2021 Tack, Jihoon
Alle Artikel auflisten