BriefGPT.xyz
Aug, 2019
解决折扣随机二人博弈问题的近似最优时间和样本复杂度
Solving Discounted Stochastic Two-Player Games with Near-Optimal Time and Sample Complexity
HTML
PDF
Aaron Sidford, Mengdi Wang, Lin F. Yang, Yinyu Ye
TL;DR
本文研究解决折扣二人零和随机游戏的采样复杂度,并提出一种计算策略的算法以及推广现有的MDP算法到多代理设置。
Abstract
In this paper, we settle the
sampling complexity
of solving discounted two-player turn-based zero-sum
stochastic games
up to polylogarithmic factors. Given a stochastic game with discount factor $\gamma\in(0,1)$
→