BriefGPT.xyz
Oct, 2022
高效示教学习的规划
Planning for Sample Efficient Imitation Learning
HTML
PDF
Zhao-Heng Yin, Weirui Ye, Qifeng Chen, Yang Gao
TL;DR
提出了EfficientImitate这一基于规划的模仿学习方法,成功地将两类看似不兼容的模仿算法:行为克隆和对抗模仿学习,自然地统一到了一个框架中,实现了在性能和样本效率方面的高水平。
Abstract
imitation learning
is a class of promising
policy learning
algorithms that is free from many practical issues with reinforcement learning, such as the reward design issue and the exploration hardness. However, th
→