基于生成对抗网络的样本高效模仿学习

Sep, 2018

基于生成对抗网络的样本高效模仿学习

Sample-Efficient Imitation Learning via Generative Adversarial Nets

Lionel Blondé, Alexandros Kalousis

TL;DR本文介绍了一种在模型free的前提下能够提高采样效率的演员评论家结构，利用了GAIL中对抗训练的方法和离策略演员评论家的优势，在多个连续控制任务中，我们证明了这种方法的简洁易行和稳定性。

Abstract

Recent work in imitation learning articulate their formulation around the GAIL architecture, relying on the adversarial training procedure introduced in →