Safe Policy Optimization with Local Generalized Linear Function Approximations

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 25 of 36
1. Verfasser: Wachi, Akifumi (VerfasserIn)
Weitere Verfasser: Wei, Yunyue (VerfasserIn), Sui, Yanan (VerfasserIn)
Pages:35
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
On sensitivity of meta-learning to support data 2022 Agarwal, Mayank
Interesting Object, Curious Agent: Learning Task-Agnostic Exploration 2022 Parisi, Simone
SimiGrad: Fine-Grained Adaptive Batching for Large Scale Training using Gradient Similarity Measurement 2022 Qin, Heyang
STORM+: Fully Adaptive SGD with Recursive Momentum for Nonconvex Optimization 2022 Levy, Kfir
Safe Policy Optimization with Local Generalized Linear Function Approximations 2022 Wachi, Akifumi
Exponential Separation between Two Learning Models and Adversarial Robustness 2022 Gluch, Grzegorz
Efficient Online Estimation of Causal Effects by Deciding What to Observe 2022 Gupta, Shantanu
Perturbation Theory for the Information Bottleneck 2022 Ngampruetikorn, Vudtiwat
Deconvolutional Networks on Graph Data 2022 Li, Jia
Duplex Sequence-to-Sequence Learning for Reversible Machine Translation 2022 Zheng, Zaixiang
Provably Efficient Causal Reinforcement Learning with Confounded Observational Data 2022 Wang, Lingxiao
The Effect of the Intrinsic Dimension on the Generalization of Quadratic Classifiers 2022 Latorre, Fabian
BCORLE(λ): An Offline Reinforcement Learning and Evaluation Framework for Coupons Allocation in E-commerce Market 2022 Zhang, Yang
Across-animal odor decoding by probabilistic manifold alignment 2022 Herrero-Vidal, Pedro
Excess Capacity and Backdoor Poisoning 2022 Manoj, Naren
Exponential Bellman Equation and Improved Regret Bounds for Risk-Sensitive Reinforcement Learning 2022 Fei, Yingjie
On Large-Cohort Training for Federated Learning 2022 Charles, Zachary
Private learning implies quantum stability 2022 Quek, Yihui
The best of both worlds: stochastic and adversarial episodic MDPs with unknown transition 2022 Jin, Tiancheng
Variational Inference for Continuous-Time Switching Dynamical Systems 2022 Köhs, Lukas
Alle Artikel auflisten