Lazy-MDPs: Towards Interpretable RL by Learning When to Act
|
2022 |
Jacq, Alexis |
ASM-PPO: Asynchronous and Scalable Multi-Agent PPO for Cooperative Charging
|
2022 |
Liang, Yongheng |
Equilibrium Computation For Knockout Tournaments Played By Groups
|
2022 |
Lisowski, Grzegorz |
Deploying Vaccine Distribution Sites for Improved Accessibility and Equity to Support Pandemic Response
|
2022 |
Li, George Z. |
Warmth and Competence in Human-Agent Cooperation
|
2022 |
McKee, Kevin R. |
Learning to Transfer Role Assignment Across Team Sizes
|
2022 |
Nguyen, Dung |
Preference-Based Goal Refinement in BDI Agents
|
2022 |
Mohajeriparizi, Mostafa |
Factorial Agent Markov Model: Modeling Other Agents’ Behavior in presence of Dynamic Latent Decision Factors
|
2022 |
Orlov-Savko, Liubove |
Networked Restless Multi-Armed Bandits for Mobile Interventions
|
2022 |
Ou, Han-Ching |
Revenue and User Traffic Maximization in Mobile Short-Video Advertising
|
2022 |
Ran, Dezhi |
Decoupled Reinforcement Learning to Stabilise Intrinsicaliy-Motivated Exploration
|
2022 |
Schäfer, Lukas |
Off-Policy Evolutionary Reinforcement Learning with Maximum Mutations
|
2022 |
Suri, Karush |
Context-Aware Modelling for Multi-Robot Systems Under Uncertainty
|
2022 |
Street, Charlie |
Socially Supervised Representation Learning: The Role of Subjectivity in Learning Efficient Representations
|
2022 |
Taylor, Julius |
Optimal Matchings with One-Sided Preferences: Fixed and Cost-Based Quotas
|
2022 |
Santhini, K. A. |
How Hard is Safe Bribery?
|
2022 |
Karia, Neel |
BADDr: Bayes-Adaptive Deep Dropout RL for POMDPs
|
2022 |
Katt, Sammie |
Tactile Pose Estimation and Policy Learning for Unknown Object Manipulation
|
2022 |
Kelestemur, Tarik |
Equilibria in Schelling Games: Computational Hardness and Robustness
|
2022 |
Kreisel, Luca |
Multimodal Analysis of the Predictability of Hand-gesture Properties
|
2022 |
Kucherenko, Taras |