BriefGPT.xyz
Jun, 2021
连续状态空间中的样本高效强化学习:超越线性的视角
Sample Efficient Reinforcement Learning In Continuous State Spaces: A Perspective Beyond Linearity
HTML
PDF
Dhruv Malik, Aldo Pacchiano, Vishwak Srinivasan, Yuanzhi Li
TL;DR
提出了Effective Planning Window(EPW)条件,并提供一种算法来证明满足该条件的MDPs具有有效的样本使用率,该条件是在RL中不需要假设线性结构的一种结构性条件。
Abstract
reinforcement learning
(RL) is empirically successful in complex
nonlinear
Markov decision processes (MDPs) with continuous state spaces. By contrast, the majority of theoretical RL literature requires the MDP to
→