BriefGPT.xyz
Jul, 2018
基于观察数据的生成对抗模仿
Generative Adversarial Imitation from Observation
HTML
PDF
Faraz Torabi, Garrett Warnell, Peter Stone
TL;DR
本文提出了一种基于生成对抗网络的从观察中模仿学习方法(GAIfO),它可以在没有行动信息的情况下直接从状态演示中学习,进行了两种不同设置的实验证明它在高维模拟环境中优于现有的直接从状态演示方法。
Abstract
Imitation from
observation
(IfO) is the problem of learning directly from
state-only demonstrations
without having access to the demonstrator's actions. The lack of action information both distinguishes IfO from
→