BriefGPT.xyz
May, 2019
面向任务不可知的深度强化学习动态先验
Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
HTML
PDF
Yilun Du, Karthik Narasimhan
TL;DR
通过学习通用的动力学先验以及使用物理视频进行预训练和微调,我们提出了一种加速政策学习和改进表现的基于模型的深度强化学习方法,同时也实现了更有效的环境迁移。
Abstract
While
model-based deep reinforcement learning
(RL) holds great promise for sample efficiency and generalization, learning an accurate
dynamics model
is often challenging and requires substantial interaction with
→