BriefGPT.xyz
Sep, 2018
基于生成对抗网络的样本高效模仿学习
Sample-Efficient Imitation Learning via Generative Adversarial Nets
HTML
PDF
Lionel Blondé, Alexandros Kalousis
TL;DR
本文介绍了一种在模型free的前提下能够提高采样效率的演员评论家结构,利用了GAIL中对抗训练的方法和离策略演员评论家的优势,在多个连续控制任务中,我们证明了这种方法的简洁易行和稳定性。
Abstract
Recent work in
imitation learning
articulate their formulation around the GAIL architecture, relying on the
adversarial training
procedure introduced in
→