Optimal Policies Tend To Seek Power

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeurIPS (35. : 2021 : Online) 35th Conference on Neural Information Processing Systems (NeurIPS 2021) ; Volume 28 of 36
1. Verfasser: Turner, Alex (VerfasserIn)
Weitere Verfasser: Smith, Logan (VerfasserIn), Shah, Rohin (VerfasserIn), Critch, Andrew (VerfasserIn), Tadepalli, Prasad (VerfasserIn)
Pages:35
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
NN-Baker: A Neural-network Infused Algorithmic Framework for Optimization Problems on Geometric Intersection Graphs 2022 McCarty, Evan
RMIX: Learning Risk-Sensitive Policies for Cooperative Reinforcement Learning Agents 2022 Qiu, Wei
XDO: A Double Oracle Algorithm for Extensive-Form Games 2022 McAleer, Stephen
Minibatch and Momentum Model-based Methods for Stochastic Weakly Convex Optimization 2022 Deng, Qi
Bandits with Knapsacks beyond the Worst Case 2022 Sankararaman, Karthik Abinav
Learning to Compose Visual Relations 2022 Liu, Nan
Fair Sparse Regression with Clustering: An Invex Relaxation for a Combinatorial Problem 2022 Barik, Adarsh
Unbalanced Optimal Transport through Non-negative Penalized Linear Regression 2022 Chapel, Laetitia
Adaptive Diffusion in Graph Neural Networks 2022 Zhao, Jialin
PartialFed: Cross-Domain Personalized Federated Learning via Partial Initialization 2022 Sun, Benyuan
Interpolation can hurt robust generalization even when there is no noise 2022 Donhauser, Konstantin
Towards a Theoretical Framework of Out-of-Distribution Generalization 2022 Ye, Haotian
Fast rates for prediction with limited expert advice 2022 Saad, El Mehdi
MERLOT: Multimodal Neural Script Knowledge Models 2022 Zellers, Rowan
Off-Policy Risk Assessment in Contextual Bandits 2022 Huang, Audrey
Estimating Multi-cause Treatment Effects via Single-cause Perturbation 2022 Qian, Zhaozhi
Multiclass versus Binary Differentially Private PAC Learning 2022 Sivakumar, Satchit
Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses 2022 Luo, Haipeng
Cycle Self-Training for Domain Adaptation 2022 Liu, Hong
A Note on Sparse Generalized Eigenvalue Problem 2022 Cai, Yunfeng
Alle Artikel auflisten