具有形式学习时间保证的增量式基于模型的学习器

Jun, 2012

具有形式学习时间保证的增量式基于模型的学习器

Incremental Model-based Learners With Formal Learning-Time Guarantees

Alexander L. Strehl, Lihong Li, Michael L. Littman

TL;DR研究了使用实时动态规划加速基于模型的学习算法，提高了在求解有限状态和动作空间的马尔可夫决策问题时的计算效率，并在 PAC 意义下证明了这两种算法的高效性。

Abstract

model-based learning algorithms have been shown to use experience efficiently when learning to solve markov decision processes (MDPs) with finite state and action spaces. However, their high computational cost du