old_topic:offline_rl
문서의 이전 판입니다!
Offline RL
- 2021-07 Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation
- 2021-07 Offline Meta-Reinforcement Learning with Online Self-Supervision
- 2021-07 Conservative Objective Models for Effective Offline Model-Based Optimization
- 2021-06 Reinforcement Learning as One Big Sequence Modeling Problem
- 2021-06 Decision Transformer: Reinforcement Learning via Sequence Modeling
- 2021-04 Actionable Models: Unsupervised Offline Reinforcement Learning of Robotic Skills
- 2021-03 Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation
- 2021-01 What Can I Do Here? Learning New Skills by Imagining Visual Affordances
- 2020-10 Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
- 2020-07 Hyperparameter Selection for Offline Reinforcement Learning
- 2020-07 Accelerating Online Reinforcement Learning with Offline Datasets
- 2020-06 Conservative Q-Learning for Offline Reinforcement Learning
old_topic/offline_rl.1705021808.txt.gz · 마지막으로 수정됨: 2024/03/23 02:38 (바깥 편집)