强化学习中的状态表示选择

Feb, 2013

Selecting the State-Representation in Reinforcement Learning

Odalric-Ambrym Maillard, Rémi Munos, Daniil Ryabko

TL;DR该研究论文研究了强化学习中选择正确的状态表示问题,提出了一种算法在不知道正确模型的情况下获得尽可能多的奖励。

Abstract

The problem of selecting the right state-representation in a reinforcement learning problem is considered. Several models (functions mapping past observations to a finite set) of the observations are given, and i