EXPLOITING STRUCTURED DATA FOR LEARNING CONTAGIOUS DISEASES UNDER INCOMPLETE TESTING
|
2022 |
MAKAR, MAGGIE |
EFFICIENT DEVIATION TYPES AND LEARNING FOR HINDSIGHT RATIONALITY IN EXTENSIVE-FORM GAMES
|
2022 |
MORRILL, DUSTIN |
EMERGENT SOCIAL LEARNING VIA MULTI-AGENT REINFORCEMENT LEARNING
|
2022 |
NDOUSSE, KAMAL |
BAYESIAN ALGORITHM EXECUTION: ESTIMATING COMPUTABLE PROPERTIES OF BLACK-BOX FUNCTIONS USING MUTUAL INFORMATION
|
2022 |
NEISWANGER, WILLIE |
NEAR-OPTIMAL ALGORITHMS FOR EXPLAINABLE K-MEDIANS AND K-MEANS
|
2022 |
MAKARYCHEV, KONSTANTIN |
NEAR-OPTIMAL MODEL-FREE REINFORCEMENT LEARNING IN NON-STATIONARY EPISODIC MDPS
|
2022 |
MAO, WEICHAO |
INVERSE CONSTRAINED REINFORCEMENT LEARNING
|
2022 |
MALIK, SHEHRYAR |
LEARN2HOP: LEARNED OPTIMIZATION ON ROUGH LANDSCAPES
|
2022 |
MERCHANT, AMIL |
GMAC: A DISTRIBUTIONAL PERSPECTIVE ON ACTOR-CRITIC FRAMEWORK
|
2022 |
NAM, DANIEL W. |
INCENTIVIZING COMPLIANCE WITH ALGORITHMIC INSTRUMENTS
|
2022 |
NGO, DUNG DANIEL T. |
GEOMETRIC CONVERGENCE OF ELLIPTICAL SLICE SAMPLING
|
2022 |
NATAROVSKII, VIACHESLAV |
STABILITY AND CONVERGENCE OF STOCHASTIC GRADIENT CLIPPING: BEYOND LIPSCHITZ CONTINUITY AND SMOOTHNESS
|
2022 |
MAI, VIEN V. |
UCB MOMENTUM Q-LEARNING: CORRECTING THE BIAS WITHOUT FORGETTING
|
2022 |
MÉNARD, PIERRE |
PROVABLY EFFICIENT LEARNING OF TRANSFERABLE REWARDS
|
2022 |
METELLI, ALBERTO MARIA |
SIGNATURED DEEP FICTITIOUS PLAY FOR MEAN FIELD GAMES WITH COMMON NOISE
|
2022 |
MIN, MING |
OFFLINE META-REINFORCEMENT LEARNING WITH ADVANTAGE WEIGHTING
|
2022 |
MITCHELL, ERIC |
PODS: POLICY OPTIMIZATION VIA DIFFERENTIABLE SIMULATION
|
2022 |
ZAMORA, MIGUEL |
VALUE-AT-RISK OPTIMIZATION WITH GAUSSIAN PROCESSES
|
2022 |
NGUYEN, QUOC PHONG |
DOMAIN GENERALIZATION USING CAUSAL MATCHING
|
2022 |
MAHAJAN, DIVYAT |
ADAPTIVE SAMPLING FOR BEST POLICY IDENTIFICATION IN MARKOV DECISION PROCESSES
|
2022 |
MARJANI, AYMEN AL |