GRID-TO-GRAPH: FLEXIBLE SPATIAL RELATIONAL INDUCTIVE BIASES FOR REINFORCEMENT LEARNING

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:AAMAS (20. : 2021 : Online) 20th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2021) ; Volume 2 of 3
1. Verfasser: Jiang, Zhengyao (VerfasserIn)
Weitere Verfasser: Minervini, Pasquale (VerfasserIn), Jiang, Minqi (VerfasserIn), Rocktäschel, Tim (VerfasserIn)
Pages:20
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2021
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
LEARNING NODE-SELECTION STRATEGIES IN BOUNDED-SUBOPTIMAL CONFLICT-BASED SEARCH FOR MULTI-AGENT PATH FINDING 2021 Huang, Taoan
ACTION ADVISING WITH ADVICE IMITATION IN DEEP REINFORCEMENT LEARNING 2021 Ilhan, Ercüment
PARTITION AGGREGATION FOR PARTICIPATORY BUDGETING 2021 Jain, Pallavi
BEYOND "TO ACT OR NOT TO ACT": FAST LAGRANGIAN APPROACHES TO GENERAL MULTI-ACTION RESTLESS BANDITS 2021 Killian, Jackson A.
APPROVAL-BASED SHORTLISTING 2021 Lackner, Martin
PARALLEL CURRICULUM EXPERIENCE REPLAY IN DISTRIBUTED REINFORCEMENT LEARNING 2021 Li, Yuyu
STRUCTURED DIVERSIFICATION EMERGENCE VIA REINFORCED ORGANIZATION CONTROL AND HIERACHICAL CONSENSUS LEARNING 2021 Li, Wenhao
ENERGY-BASED IMITATION LEARNING 2021 Liu, Minghuan
LET THE DOCTOR DECIDE WHOM TO TEST: ADAPTIVE TESTING STRATEGIES TO TACKLE THE COVID-19 PANDEMIC 2021 Liang, Yu
CONTRASTING CENTRALIZED AND DECENTRALIZED CRITICS IN MULTI-AGENT REINFORCEMENT LEARNING 2021 Lyu, Xueguang
TO HOLD OR NOT TO HOLD? - REDUCING PASSENGER MISSED CONNECTIONS IN AIRLINES USING REINFORCEMENT LEARNING 2021 Malladi, Tejasvi
ADVERSARIAL LEARNING IN REVENUE-MAXIMIZING AUCTIONS 2021 Nedelec, Thomas
AN AGENT-BASED MODEL TO PREDICT PEDESTRIANS TRAJECTORIES WITH AN AUTONOMOUS VEHICLE IN SHARED SPACES 2021 Prédhumeau, Manon
LATENCY-AWARE LOCAL SEARCH FOR DISTRIBUTED CONSTRAINT OPTIMIZATION 2021 Rachmut, Ben
PEER-TO-PEER AUTONOMOUS AGENT COMMUNICATION NETWORK 2021 Rahmani, Lokman
TDPROP: DOES ADAPTIVE OPTIMIZATION WITH JACOBI PRECONDITIONING HELP TEMPORAL DIFFERENCE LEARNING? 2021 Romoff, Joshua
A LOCAL SEARCH BASED APPROACH TO SOLVE CONTINUOUS DCOPS 2021 Sarker, Amit
ACTIVE PERCEPTION WITHIN BDI AGENTS REASONING CYCLE 2021 Silva, Gustavo R.
ALWAYSSAFE: REINFORCEMENT LEARNING WITHOUT SAFETY CONSTRAINT VIOLATIONS DURING TRAINING 2021 Simão, Thiago D.
COOPERATIVE-COMPETITIVE REINFORCEMENT LEARNING WITH HISTORY-DEPENT REWARDS 2021 He, Keyang
Alle Artikel auflisten