Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ECML PKDD (4. : 2019 : Würzburg) Machine learning and knowledge discovery in databases ; Part 3
1. Verfasser: Wellmer, Zac (VerfasserIn)
Weitere Verfasser: Kwok, James T. (VerfasserIn)
Pages:3
Format: UnknownFormat
Sprache:eng
Veröffentlicht: 2020
Schlagworte:
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Titel Jahr Verfasser
Learning 3D Navigation Protocols on Touch Interfaces with Cooperative Multi-agent Reinforcement Learning 2020 Debard, Quentin
Safe Policy Improvement with Soft Baseline Bootstrapping 2020 Nadjahi, Kimia
Stochastic Activation Actor Critic Methods 2020 Shang, Wenling
A Ranking Model Motivated by Nonnegative Matrix Factorization with Applications to Tennis Tournaments 2020 Xia, Rui
A Reduction of Label Ranking to Multiclass Classification 2020 Brinker, Klaus
Pairwise Learning to Rank by Neural Networks Revisited: Reconstruction, Theoretical Analysis and Practical Performance 2020 Köppel, Marius
Automatic Recognition of Student Engagement Using Deep Learning and Facial Expression 2020 Nezami, Omid Mohamad
Augmenting Semantic Representation of Depressive Language: From Forums to Microblogs 2020 Farruque, Nawshad
Wearable-Based Parkinson's Disease Severity Monitoring Using Deep Learning 2020 Goschenhofer, Jann
A Deep Multi-task Approach for Residual Value Forecasting 2020 Rashed, Ahmed
Manufacturing Dispatching Using Reinforcement and Transfer Learning 2020 Zheng, Shuai
MatrixCalculus.org – Computing Derivatives of Matrix and Tensor Expressions 2020 Laue, Sören
ISETS: Incremental Shapelet Extraction from Time Series Stream 2020 Zuo, Jingwei
Practical Open-Loop Optimistic Planning 2020 Leurent, Edouard
An Engineered Empirical Bernstein Bound 2020 Burgess, Mark A.
Attentive Multi-task Deep Reinforcement Learning 2020 Bräm, Timo
Sequential Learning over Implicit Feedback for Robust Large-Scale Recommender System 2020 Burashnikova, Aleksandra
Transfer Learning in Credit Risk 2020 Suryanto, Hendra
LSTM Encoder-Predictor for Short-Term Train Load Forecasting 2020 Pasini, Kevin
BK-ADAPT: Dynamic Background Knowledge for Automating Data Transformation 2020 Contreras-Ochando, Lidia
Alle Artikel auflisten