BriefGPT.xyz
Nov, 2022
从单个演示中利用连续性进行强化学习
Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration
HTML
PDF
Alexandre Chenu, Olivier Serris, Olivier Sigaud, Nicolas Perrin-Gilbert
TL;DR
该研究使用深度强化学习通过单个演示来学习控制复杂机器人任务的目标条件策略,并提出 DCIL-II 算法以解决连续目标之间的兼容性问题,并在仿真环境中展示了前所未有的样本效率。
Abstract
deep reinforcement learning
has been successfully applied to learn
robotic control
. However, the corresponding algorithms struggle when applied to problems where the agent is only rewarded after achieving a compl
→