BriefGPT.xyz
Apr, 2022
部分可观察的强化学习何时不可怕?
When Is Partially Observable Reinforcement Learning Not Scary?
HTML
PDF
Qinghua Liu, Alan Chung, Csaba Szepesvári, Chi Jin
TL;DR
该论文介绍了应用于部分可观测的情况下的强化学习模型,探讨了在一些特殊情况下该模型的使用,提出了一种通过乐观估计与极大似然估计相结合的简单算法,能够保证在这些特殊情况下有多项式样本复杂度可行的方法。
Abstract
Applications of
reinforcement learning
(RL), in which agents learn to make a sequence of decisions despite lacking complete information about the latent states of the controlled system, that is, they act under
partial o
→