基于观察数据的生成对抗模仿

Jul, 2018

Generative Adversarial Imitation from Observation

Faraz Torabi, Garrett Warnell, Peter Stone

TL;DR本文提出了一种基于生成对抗网络的从观察中模仿学习方法（GAIfO），它可以在没有行动信息的情况下直接从状态演示中学习，进行了两种不同设置的实验证明它在高维模拟环境中优于现有的直接从状态演示方法。

Abstract

Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions. The lack of action information both distinguishes IfO from