强化学习中任务规范的统一

Sep, 2016

Unifying task specification in reinforcement learning

Martha White

TL;DR介绍了一种RL任务形式化的方法，通过简单的构造实现了不同任务设置的统一，包括状态转移折扣的概括；并拓展了标准学习构造和部分理论结果，提供了一个可理解和可靠的形式化框架来构建理论和简化算法使用和开发。

Abstract

reinforcement learning tasks are typically specified as markov decision processes. This formalism has been highly successful, though specifications often couple the dynamics of the environment and the learning ob