Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 33 of 36
1. Verfasser:	Xie, Tengyang (VerfasserIn)
Weitere Verfasser:	Jiang, Nan (VerfasserIn), Wang, Huan (VerfasserIn), Xiong, Caiming (VerfasserIn), Bai, Yu (VerfasserIn)
Pages:	35
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	2022
Schlagworte:
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Titel	Jahr	Verfasser
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture	2022	Lee, Suyoung
Localization with Sampling-Argmax	2022	Li, Jiefeng
An analysis of Ermakov-Zolotukhin quadrature using kernels	2022	Belhadji, Ayoub
Robust Contrastive Learning Using Negative Samples with Diminished Semantics	2022	Ge, Songwei
Gauge Equivariant Transformer	2022	He, Lingshen
SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs	2022	Sekhari, Ayush
A Non-commutative Extension of Lee-Seung' s Algorithm for Positive Semidefinite Factorizations	2022	Soh, Yong Sheng
Instance-Conditioned GAN	2022	Casanova, Arantxa
A Gaussian Process-Bayesian Bernoulli Mixture Model for Multi-Label Active Learning	2022	Shi, Weishi
A Unified View of cGANs with and without Classifiers	2022	Chen, Si-An
Local Hyper-Flow Diffusion	2022	Fountoulakis, Kimon
Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error	2022	Brown-Cohen, Jonah
Dueling Bandits with Adversarial Sleeping	2022	Saha, Aadirupa
Robust Predictable Control	2022	Eysenbach, Ben
Spatial-Temporal Super-Resolution of Satellite Imagery via Conditional Pixel Synthesis	2022	He, Yutong
Garment4D: Garment Reconstruction from Point Cloud Sequences	2022	Hong, Fangzhou
You Are the Best Reviewer of Your Own Papers: An Owner-Assisted Scoring Mechanism	2022	Su, Weijie
Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization	2022	Cen, Shicong
Dynamical Wasserstein Barycenters for Time-series Modeling	2022	Cheng, Kevin
Improved Regularization and Robustness for Fine-tuning in Neural Networks	2022	Li, Dongyue

Alle Artikel auflisten