BriefGPT.xyz
Jul, 2018
回放:必须不停地倒转
Backplay: "Man muss immer umkehren"
HTML
PDF
Cinjon Resnick, Roberta Raileanu, Sanyam Kapoor, Alex Peysakhovich, Kyunghyun Cho...
TL;DR
提高样本效率是模型自由强化学习中的一个挑战,本文提出了一种名为Backplay的方法,利用单个演示构建任务的课程并以该演示的末端为起点进行训练,最终在可竞争方法中优化训练速度。
Abstract
A long-standing problem in model free
reinforcement learning
(RL) is that it requires a large number of trials to learn a good policy, especially in environments with sparse rewards. We explore a method to increase the
→