具有局部最优示例的连续逆优化控制

Jun, 2012

Continuous Inverse Optimal Control with Locally Optimal Examples

Sergey Levine, Vladlen Koltun

TL;DR本文介绍了一种适用于大规模连续任务的概率反向最优控制算法，通过使用奖励函数的局部估计值，该方法可以学习来自非全局最优演示的例子，并消除全局最优的假设。

Abstract

inverse optimal control, also known as inverse reinforcement learning, is the problem of recovering an unknown reward function in a markov decisi