Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	NeurIPS (36. : 2022 : New Orleans, La.; Online) 36th Conference on Neural Information Processing Systems (NeurIPS 2022 ; Volume 26 of 50
1. Verfasser:	Terpin, Antonio (VerfasserIn)
Weitere Verfasser:	Lanzetti, Nicolas (VerfasserIn), Yardim, Batuhan (VerfasserIn), Dorfler, Florian (VerfasserIn), Ramponi, Giorgia (VerfasserIn)
Pages:	36
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	2023
Schlagworte:
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Titel	Jahr	Verfasser
Bayesian Clustering of Neural Spiking Activity Using a Mixture of Dynamic Poisson Factor Analyzers	2023	Wei, Ganchao
Neural Estimation of Submodular Functions with Applications to Differentiable Subset Selection	2023	De, Abir
Maximum Class Separation as Inductive Bias in One Matrix	2023	Kasarla, Tejaswi
Mean Estimation in High-Dimensional Binary Markov Gaussian Mixture Models	2023	Zhang, Yihan
Online Frank-Wolfe with Arbitrary Delays	2023	Wan, Yuanyu
Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions	2023	Terpin, Antonio
Low-Rank Modular Reinforcement Learning via Muscle Synergy	2023	Dong, Heng
Improving Barely Supervised Learning by Discriminating Unlabeled Samples with Super-Class	2023	Gui, Guan
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets	2023	Min, Yifei
Large-Scale Retrieval for Reinforcement Learning	2023	Humphreys, Peter
TaSIL: Taylor Series Imitation Learning	2023	Pfrommer, Daniel
Multi-agent Dynamic Algorithm Configuration	2023	Xue, Ke
Continuous MDP Homomorphisms and Homomorphic Policy Gradient	2023	Rezaei-Shoshtari, Sahand
Learning to Follow Instructions in Text-Based Games	2023	Tuli, Mathieu
Surprising Instabilities in Training Deep Networks and a Theoretical Analysis	2023	Sun, Yuxin
Finite-Time Analysis of Adaptive Temporal Difference Learning with Deep Neural Networks	2023	Sun, Tao
A Geometric Perspective on Variational Autoencoders	2023	Chadebec, Clément
Graph Neural Networks with Adaptive Readouts	2023	Buterez, David
Tight Lower Bounds on Worst-Case Guarantees for Zero-Shot Learning with Attributes	2023	Mazzetto, Alessio
ELIAS: End-to-End Learning to Index and Search in Large Output Spaces	2023	Gupta, Nilesh

Alle Artikel auflisten