BriefGPT.xyz
Nov, 2019
模仿学习方法的差异最小化视角
A Divergence Minimization Perspective on Imitation Learning Methods
HTML
PDF
Seyed Kamyar Seyed Ghasemipour, Richard Zemel, Shixiang Gu
TL;DR
本文提出了一种基于分歧最小化的Imitation Learning方法,即$f$-MAX,将IRL方法如GAIL和AIRL联系起来并揭示了它们的算法特性,通过期望最大化演算法来教授机器人在推手环境中进行多样化的行为。
Abstract
In many settings, it is desirable to learn decision-making and control policies through learning or bootstrapping from expert demonstrations. The most common approaches under this
imitation learning
(IL) framework are
b
→