BriefGPT.xyz
Mar, 2017
第三人称模仿学习
Third-Person Imitation Learning
HTML
PDF
Bradly C. Stadie, Pieter Abbeel, Ilya Sutskever
TL;DR
本文提出了一种利用领域混淆技术进行无监督第三人称模仿学习的方法,证明了该方法在点质点领域、伸手领域和倒立摆等领域的第三人称模仿学习中取得成功。
Abstract
reinforcement learning
(RL) makes it possible to train agents capable of achiev- ing sophisticated goals in complex and uncertain environments. A key difficulty in
reinforcement learning
is specifying a reward fu
→