RL for Latent MDPs: Regret Guarantees and a Lower Bound

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 29 of 36
1. Verfasser: Kwon, Jeongyeol (VerfasserIn)
Weitere Verfasser: Efroni, Yonathan (VerfasserIn), Caramanis, Constantine (VerfasserIn), Mannor, Shie (VerfasserIn)
Pages:35
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Instance-Dependent Bounds for Zeroth-order Lipschitz Optimization with Error Certificates 2022 Bachoc, Francois
Memory Efficient Meta-Learning with Large Images 2022 Bronskill, John
Fast and accurate randomized algorithms for low-rank tensor decompositions 2022 Ma, Linjian
Structured in Space, Randomized in Time: Leveraging Dropout in RNNs for Efficient Training 2022 Sarma, Anup
Recognizing Vector Graphics without Rasterization 2022 JIANG, XINYANG
Active Offline Policy Selection 2022 Konyushova, Ksenia
Particle Cloud Generation with Message Passing Generative Adversarial Networks 2022 Kansal, Raghav
Densely connected normalizing flows 2022 Grcić, Matej
Subgame solving without common knowledge 2022 Zhang, Brian
VAST: Value Function Factorization with Variable Agent Sub-Teams 2022 Phan, Thomy
Multiwavelet-based Operator Learning for Differential Equations 2022 Gupta, Gaurav
Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction 2022 Stöger, Dominik
Efficient Training of Visual Transformers with Small Datasets 2022 Liu, Yahui
CoFiNet: Reliable Coarse-to-fine Correspondences for Robust PointCloud Registration 2022 Yu, Hao
LLC: Accurate, Multi-purpose Learnt Low-dimensional Binary Codes 2022 Kusupati, Aditya
POODLE: Improving Few-shot Learning via Penalizing Out-of-Distribution Samples 2022 Le, Duong
Efficiently Learning One Hidden Layer ReLU Networks From Queries 2022 Chen, Sitan
Learning to Schedule Heuristics in Branch and Bound 2022 Chmiela, Antonia
MLP-Mixer: An all-MLP Architecture for Vision 2022 Tolstikhin, Ilya O
Communication-efficient SGD: From Local SGD to One-Shot Averaging 2022 Spiridonoff, Artin
Alle Artikel auflisten