Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 7 of 36
1. Verfasser:	Spooner, Thomas (VerfasserIn)
Weitere Verfasser:	Vadori, Nelson (VerfasserIn), Ganesh, Sumitra (VerfasserIn)
Pages:	35
Format:	UnknownFormat
Sprache:	eng
Veröffentlicht:	2022
Schlagworte:
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Bestellen

Titel	Jahr	Verfasser
Offline RL Without Off-Policy Evaluation	2022	Brandfonbrener, David
Continuous vs. Discrete Optimization of Deep Neural Networks	2022	Elkabetz, Omer
Can contrastive learning avoid shortcut solutions?	2022	Robinson, Joshua
Matrix encoding networks for neural combinatorial optimization	2022	Kwon, Yeong-Dae
Capturing implicit hierarchical structure in 3D biomedical images with self-supervised hyperbolic representations	2022	Hsu, Joy
Continuous Latent Process Flows	2022	Deng, Ruizhi
Domain Invariant Representation Learning with Domain Density Transformations	2022	Nguyen, A. Tuan
Blending Anti-Aliasing into Vision Transformer	2022	Qian, Shengju
Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks	2022	Fallah, Alireza
Redesigning the Transformer Architecture with Insights from Multi-particle Dynamical Systems	2022	Dutta, Subhabrata
Similarity and Matching of Neural Network Representations	2022	Csiszárik, Adrián
On the Variance of the Fisher Information for Deep Learning	2022	Soen, Alexander
Brick-by-Brick: Combinatorial Construction with Deep Reinforcement Learning	2022	Chung, Hyunsoo
Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss	2022	HaoChen, Jeff Z.
When False Positive is Intolerant: End-to-End Optimization with Low FPR for Multipartite Ranking	2022	Wen, Peisong
Convex Polytope Trees	2022	Armandpour, Mohammadreza
Stability and Deviation Optimal Risk Bounds with Convergence Rate O(1/n)	2022	Klochkov, Yegor
CLDA: Contrastive Learning for Semi-Supervised Domain Adaptation	2022	Singh, Ankit
Differentially Private n-gram Extraction	2022	Kim, Kunho
SketchGen: Generating Constrained CAD Sketches	2022	Para, Wamiq

Alle Artikel auflisten