POLICY-GRADIENT ALGORITHMS HAVE NO GUARANTEES OF CONVERGENCE IN LINEAR QUADRATIC GAMES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:AAMAS (19. : 2020 : Online) International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2020) ; Volume 2 of 3
1. Verfasser: Mazumdar, Eric (VerfasserIn)
Weitere Verfasser: Ratliff, Lillian J. (VerfasserIn), Jordan, Michael I. (VerfasserIn), Sastry, S. Shankar (VerfasserIn)
Pages:2020
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
A STORY OF TWO STREAMS: REINFORCEMENT LEARNING MODELS FROM HUMAN BEHAVIOR AND NEUROPSYCHIATRY 2021 Lin, Baihan
OFF-POLICY DEEP REINFORCEMENT LEARNING WITH ANALOGOUS DISENTANGLED EXPLORATION 2021 Liu, Anji
COMPETITIVE RATIOS FOR ONLINE MULTI-CAPACITY RIDESHARING 2021 Lowalekar, Meghna
A BUDGET-LIMITED MECHANISM FOR CATEGORY-AWARE CROWDSOURCING SYSTEMS 2021 Luo, Yuan
LIKELIHOOD QUANTILE NETWORKS FOR COORDINATING MULTI-AGENT REINFORCEMENT LEARNING 2021 Lyu, Xueguang
OPTIMAL TEMPORAL PLAN MERGING 2021 Santos, Gilberto Marcon Dos
POLICY-GRADIENT ALGORITHMS HAVE NO GUARANTEES OF CONVERGENCE IN LINEAR QUADRATIC GAMES 2021 Mazumdar, Eric
THE COMPLEXITY OF CLONING CANDIDATES IN MULTIWINNER ELECTIONS 2021 Neveling, Marc
MULTIWINNER CANDIDACY GAMES 2021 Obraztsova, Svetlana
DRIVING EXPLORATION BY MAXIMUM DISTRIBUTION IN GAUSSIAN PROCESS 2021 Nuara, Alessandro
GOAL RECOGNITION USING OFF-THE-SHELF PROCESS MINING TECHNIQUES 2021 Polyvyanyy, Artem
TOLL-BASED LEARNING FOR MINIMISING CONGESTION UNDER HETEROGENEOUS PREFERENCES 2021 Ramos, Gabriel De O.
A STRUCTURAL SOLUTION TO SEQUENTIAL MORAL DILEMMAS 2021 Rodriguez-Soto, Manel
MULTI-LEVEL FITNESS CRITICS FOR COOPERATIVE COEVOLUTION 2021 Rockefeller, Golden
VIRAL VS. EFFECTIVE: UTILITY BASED INFLUENCE MAXIMIZATION 2021 Sabato, Yael
BAYESIAN ACTIVE MALWARE ANALYSIS 2021 Sartea, Riccardo
CAN AGENTS LEARN BY ANALOGY? AN INFERABLE MODEL FOR PAC REINFORCEMENT LEARNING 2021 Sun, Yanchao
DIFFERENTIALLY PRIVATE CONTEXTUAL DYNAMIC PRICING 2021 Tang, Wei
PLANNABLE APPROXIMATIONS TO MDP HOMOMORPHISMS: EQUIVARIANCE UNDER ACTIONS 2021 Pol, Elise Van Der
AGENT ONTOLOGY ALIGNMENT REPAIR THROUGH DYNAMIC EPISTEMIC LOQGIC 2021 Berg, Line Van Den
Alle Artikel auflisten