BriefGPT.xyz
Mar, 2019
基于模型的Atari强化学习
Model-Based Reinforcement Learning for Atari
HTML
PDF
Lukasz Kaiser, Mohammad Babaeizadeh, Piotr Milos, Blazej Osinski, Roy H Campbell...
TL;DR
这篇文章介绍了基于视频预测模型的 Simulated Policy Learning 方法,该方法通过在仅与环境交互 100k 次(两小时实时游戏)的情况下,在多个 Atari 游戏中实现比现有的基于模型无关的方法更好的表现。
Abstract
model-free reinforcement learning
(RL) can be used to learn effective policies for complex tasks, such as
atari games
, even from image observations. However, this typically requires very large amounts of interact
→