BriefGPT.xyz
Jan, 2019
深度 Q 学习的理论分析
A Theoretical Analysis of Deep Q-Learning
HTML
PDF
Zhuora Yang, Yuchen Xie, Zhaoran Wang
TL;DR
本论文从算法和统计角度出发,对深度强化学习中的深度Q网络算法进行了理论分析,并给出了收敛速率。作者还提出了Minimax-DQN算法,并将其与马尔可夫博弈的Nash均衡进行收敛速率的比较。
Abstract
Despite the great empirical success of
deep reinforcement learning
, its
theoretical foundation
is less well understood. In this work, we make the first attempt to theoretically understand the
→