BriefGPT.xyz
May, 2020
通过生成模型在模型为基础的强化学习中突破样本量限制
Breaking the Sample Size Barrier in Model-Based Reinforcement Learning with a Generative Model
HTML
PDF
Gen Li, Yuting Wei, Yuejie Chi, Yuantao Gu, Yuxin Chen
TL;DR
研究强化学习的样本效率,证明了两种算法的最小最优性,同时实现了目标准确率的最小最优样本复杂度,这是目前首次提供涵盖整个样本范围的最小最优保证。
Abstract
We investigate the
sample efficiency
of
reinforcement learning
in a $\gamma$-discounted infinite-horizon Markov decision process (MDP) with state space $\mathcal{S}$ and action space $\mathcal{A}$, assuming acces
→