Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 33 of 36
1. Verfasser: Xie, Tengyang (VerfasserIn)
Weitere Verfasser: Jiang, Nan (VerfasserIn), Wang, Huan (VerfasserIn), Xiong, Caiming (VerfasserIn), Bai, Yu (VerfasserIn)
Pages:35
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics Mixture 2022 Lee, Suyoung
Localization with Sampling-Argmax 2022 Li, Jiefeng
An analysis of Ermakov-Zolotukhin quadrature using kernels 2022 Belhadji, Ayoub
Robust Contrastive Learning Using Negative Samples with Diminished Semantics 2022 Ge, Songwei
Gauge Equivariant Transformer 2022 He, Lingshen
SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs 2022 Sekhari, Ayush
A Non-commutative Extension of Lee-Seung' s Algorithm for Positive Semidefinite Factorizations 2022 Soh, Yong Sheng
Instance-Conditioned GAN 2022 Casanova, Arantxa
A Gaussian Process-Bayesian Bernoulli Mixture Model for Multi-Label Active Learning 2022 Shi, Weishi
A Unified View of cGANs with and without Classifiers 2022 Chen, Si-An
Local Hyper-Flow Diffusion 2022 Fountoulakis, Kimon
Faster Algorithms and Constant Lower Bounds for the Worst-Case Expected Error 2022 Brown-Cohen, Jonah
Dueling Bandits with Adversarial Sleeping 2022 Saha, Aadirupa
Robust Predictable Control 2022 Eysenbach, Ben
Spatial-Temporal Super-Resolution of Satellite Imagery via Conditional Pixel Synthesis 2022 He, Yutong
Garment4D: Garment Reconstruction from Point Cloud Sequences 2022 Hong, Fangzhou
You Are the Best Reviewer of Your Own Papers: An Owner-Assisted Scoring Mechanism 2022 Su, Weijie
Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization 2022 Cen, Shicong
Dynamical Wasserstein Barycenters for Time-series Modeling 2022 Cheng, Kevin
Improved Regularization and Robustness for Fine-tuning in Neural Networks 2022 Li, Dongyue
Alle Artikel auflisten