BriefGPT.xyz
Jul, 2021
使用变分模型的视觉对抗性模仿学习
Visual Adversarial Imitation Learning using Variational Models
HTML
PDF
Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn
TL;DR
该论文介绍了一种使用固定数据集的视觉演示来学习如何完成任务的方法,并提出了一种基于变分模型的对抗性模仿学习算法来处理高维空间、固定奖励等挑战,实验结果表明 V-MAIL 算法能够高效稳定地学习成功的视觉动作策略。
Abstract
Reward function specification, which requires considerable human effort and iteration, remains a major impediment for learning behaviors through
deep reinforcement learning
. In contrast, providing
visual demonstrations<
→