A STORY OF TWO STREAMS: REINFORCEMENT LEARNING MODELS FROM HUMAN BEHAVIOR AND NEUROPSYCHIATRY
|
2021 |
Lin, Baihan |
OFF-POLICY DEEP REINFORCEMENT LEARNING WITH ANALOGOUS DISENTANGLED EXPLORATION
|
2021 |
Liu, Anji |
COMPETITIVE RATIOS FOR ONLINE MULTI-CAPACITY RIDESHARING
|
2021 |
Lowalekar, Meghna |
A BUDGET-LIMITED MECHANISM FOR CATEGORY-AWARE CROWDSOURCING SYSTEMS
|
2021 |
Luo, Yuan |
LIKELIHOOD QUANTILE NETWORKS FOR COORDINATING MULTI-AGENT REINFORCEMENT LEARNING
|
2021 |
Lyu, Xueguang |
OPTIMAL TEMPORAL PLAN MERGING
|
2021 |
Santos, Gilberto Marcon Dos |
POLICY-GRADIENT ALGORITHMS HAVE NO GUARANTEES OF CONVERGENCE IN LINEAR QUADRATIC GAMES
|
2021 |
Mazumdar, Eric |
THE COMPLEXITY OF CLONING CANDIDATES IN MULTIWINNER ELECTIONS
|
2021 |
Neveling, Marc |
MULTIWINNER CANDIDACY GAMES
|
2021 |
Obraztsova, Svetlana |
DRIVING EXPLORATION BY MAXIMUM DISTRIBUTION IN GAUSSIAN PROCESS
|
2021 |
Nuara, Alessandro |
GOAL RECOGNITION USING OFF-THE-SHELF PROCESS MINING TECHNIQUES
|
2021 |
Polyvyanyy, Artem |
TOLL-BASED LEARNING FOR MINIMISING CONGESTION UNDER HETEROGENEOUS PREFERENCES
|
2021 |
Ramos, Gabriel De O. |
A STRUCTURAL SOLUTION TO SEQUENTIAL MORAL DILEMMAS
|
2021 |
Rodriguez-Soto, Manel |
MULTI-LEVEL FITNESS CRITICS FOR COOPERATIVE COEVOLUTION
|
2021 |
Rockefeller, Golden |
VIRAL VS. EFFECTIVE: UTILITY BASED INFLUENCE MAXIMIZATION
|
2021 |
Sabato, Yael |
BAYESIAN ACTIVE MALWARE ANALYSIS
|
2021 |
Sartea, Riccardo |
CAN AGENTS LEARN BY ANALOGY? AN INFERABLE MODEL FOR PAC REINFORCEMENT LEARNING
|
2021 |
Sun, Yanchao |
DIFFERENTIALLY PRIVATE CONTEXTUAL DYNAMIC PRICING
|
2021 |
Tang, Wei |
PLANNABLE APPROXIMATIONS TO MDP HOMOMORPHISMS: EQUIVARIANCE UNDER ACTIONS
|
2021 |
Pol, Elise Van Der |
AGENT ONTOLOGY ALIGNMENT REPAIR THROUGH DYNAMIC EPISTEMIC LOQGIC
|
2021 |
Berg, Line Van Den |