Manuel Eberhardinger, Johannes Maucher, Setareh Maghsudi
TL;DR使用程序合成方法对深度强化学习代理进行模仿,以了解其学习的概念和决策过程。
Abstract
Understanding the interactions of agents trained with deep reinforcement
learning is crucial for deploying agents in games or the real world. In the
former, unreasonable actions confuse players. In the latter, that effect is
even more significant, as unexpected behavior cause accidents with potentially
grave and long-lasting consequences for the involved ind