BriefGPT.xyz
Feb, 2013
强化学习中的状态表示选择
Selecting the State-Representation in Reinforcement Learning
HTML
PDF
Odalric-Ambrym Maillard, Rémi Munos, Daniil Ryabko
TL;DR
该研究论文研究了强化学习中选择正确的状态表示问题,提出了一种算法在不知道正确模型的情况下获得尽可能多的奖励。
Abstract
The problem of selecting the right
state-representation
in a
reinforcement learning
problem is considered. Several models (functions mapping past observations to a finite set) of the observations are given, and i
→