Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (36. : 2022 : New Orleans, La.; Online) 36th Conference on Neural Information Processing Systems (NeurIPS 2022 ; Volume 26 of 50
1. Verfasser: Terpin, Antonio (VerfasserIn)
Weitere Verfasser: Lanzetti, Nicolas (VerfasserIn), Yardim, Batuhan (VerfasserIn), Dorfler, Florian (VerfasserIn), Ramponi, Giorgia (VerfasserIn)
Pages:36
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2023
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers 2023 Wei, Ganchao
Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection 2023 De, Abir
Maximum Class Separation as Inductive Bias in One Matrix 2023 Kasarla, Tejaswi
Mean Estimation in High-Dimensional Binary Markov Gaussian Mixture Models 2023 Zhang, Yihan
Online Frank-Wolfe with Arbitrary Delays 2023 Wan, Yuanyu
Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions 2023 Terpin, Antonio
Low-Rank Modular Reinforcement Learning via Muscle Synergy 2023 Dong, Heng
Improving Barely Supervised Learning by Discriminating Unlabeled Samples with Super-Class 2023 Gui, Guan
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets 2023 Min, Yifei
Large-Scale Retrieval for Reinforcement Learning 2023 Humphreys, Peter
TaSIL: Taylor Series Imitation Learning 2023 Pfrommer, Daniel
Multi-agent Dynamic Algorithm Configuration 2023 Xue, Ke
Continuous MDP Homomorphisms and Homomorphic Policy Gradient 2023 Rezaei-Shoshtari, Sahand
Learning to Follow Instructions in Text-Based Games 2023 Tuli, Mathieu
Surprising Instabilities in Training Deep Networks and a Theoretical Analysis 2023 Sun, Yuxin
Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks 2023 Sun, Tao
A Geometric Perspective on Variational Autoencoders 2023 Chadebec, Clément
Graph Neural Networks with Adaptive Readouts 2023 Buterez, David
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes 2023 Mazzetto, Alessio
ELIAS: End-to-End Learning to Index and Search in Large Output Spaces 2023 Gupta, Nilesh
Alle Artikel auflisten