Jun, 2019
Wasserstein 对抗性模仿学习
Wasserstein Adversarial Imitation Learning
Huang Xiao, Michael Herman, Joerg Wagner, Sebastian Ziesche, Jalal Etesami...
TL;DR本文研究 Imitation Learning,结合 Optimal Transport 提出 Wasserstein Adversarial Imitation Learning 来更高效地解决 inverse reinforcement learning 中 reward function 问题。在机器人实验中,该方法只需一个 expert demonstration 即可实现显著提升。