BriefGPT.xyz
Feb, 2022
强化学习中的计算统计差距
Computational-Statistical Gaps in Reinforcement Learning
HTML
PDF
Daniel Kane, Sihan Liu, Shachar Lovett, Gaurav Mahajan
TL;DR
本文针对强化学习中的大状态空间问题,研究使用函数逼近的强化学习方法,并提出了寻找高效率算法的方案,同时探讨了计算难度与统计问题之间的关系。
Abstract
reinforcement learning
with
function approximation
has recently achieved tremendous results in applications with large state spaces. This empirical success has motivated a growing body of theoretical work proposi
→