REINFORCEMENT LEARNING FOR COST-AWARE MARKOV DECISION PROCESSES

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International Conference on Machine Learning (38. : 2021 : Online) International Conference on Machine Learning (ICML 2021 ; Part 13 of 16
1. Verfasser: SUTTLE, WESLEY A. (VerfasserIn)
Weitere Verfasser: ZHANG, KAIQING (VerfasserIn), YANG, ZHUORAN (VerfasserIn), KRAEMER, DAVID N. (VerfasserIn), LIU, JI (VerfasserIn)
Pages:2021
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2022
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
AUTOSAMPLING: SEARCH FOR EFFECTIVE DATA SAMPLING SCHEDULES 2022 SUN, MING
NONDETERMINISM AND INSTABILITY IN NEURAL NETWORK OPTIMIZATION 2022 SUMMERS, CECILIA
REASONING OVER VIRTUAL KNOWLEDGE BASES WITH OPEN PREDICATE RELATIONS 2022 SUN, HAITIAN
MONTE CARLO VARIATIONAL AUTO-ENCODERS 2022 THIN, ACHILLE
LTL2ACTION: GENERALIZING LTL INSTRUCTIONS FOR MULTI-TASK RL 2022 VAEZIPOOR, PASHOOTAN
SCALABLE VARIATIONAL GAUSSIAN PROCESSES VIA HARMONIC KERNEL DECOMPOSITION 2022 SUN, SHENGYANG
ROBUST REPRESENTATION LEARNING VIA PERCEPTUAL SIMILARITY METRICS 2022 TAGHANAKI, SAEID A.
TAYLOR EXPANSION OF DISCOUNT FACTORS 2022 TANG, YUNHAO
T-SCI: A TWO-STAGE CONFORMAL INFERENCE ALGORITHM WITH GUARANTEED COVERAGE FOR COX-MLP 2022 TENG, JIAYE
UNDERSTANDING INVARIANCE VIA FEEDFORWARD INVERSION OF DISCRIMINATIVELY TRAINED CLASSIFIERS 2022 TETERWAK, PIOTR
RESOURCE ALLOCATION IN MULTI-ARMED BANDIT EXPLORATION: OVERCOMING SUBLINEAR SCALING WITH ADAPTIVE PARALLELISM 2022 THANANJEYAN, BRIJEN
TOWARDS DOMAIN-AGNOSTIC CONTRASTIVE LEARNING 2022 VERMA, VIKAS
UNBIASED GRADIENT ESTIMATION IN UNROLLED COMPUTATION GRAPHS WITH PERSISTENT EVOLUTION STRATEGIES 2022 VICOL, PAUL
ACCELERATING FEEDFORWARD COMPUTATION VIA PARALLEL NONLINEAR EQUATION SOLVING 2022 SONG, YANG
CAUSAL CURIOSITY: RL AGENTS DISCOVERING SELF-SUPERVISED EXPERIMENTS FOR CAUSAL REPRESENTATION LEARNING 2022 SONTAKKE, SUMEDH A.
DECOMPOSED MUTUAL INFORMATION ESTIMATION FOR CONTRASTIVE REPRESENTATION LEARNING 2022 SORDONI, ALESSANDRO
DFAC FRAMEWORK: FACTORIZING THE VALUE FUNCTION VIA QUANTILE MIXTURE FOR MULTI-AGENT DISTRIBUTIONAL Q-LEARNING 2022 SUN, WEI-FANG
CONSERVATIVE OBJECTIVE MODELS FOR EFFECTIVE OFFLINE MODEL-BASED OPTIMIZATION 2022 TRABUCCO, BRANDON
FAST PROJECTION ONTO CONVEX SMOOTH CONSTRAINTS 2022 USMANOVA, ILNURA
MULTI-TASK REINFORCEMENT LEARNING WITH CONTEXT-BASED REPRESENTATIONS 2022 SODHANI, SHAGUN
Alle Artikel auflisten