BriefGPT.xyz
Oct, 2021
何时才能高效学习具有多个玩家的广义和马尔可夫博弈?
When Can We Learn General-Sum Markov Games with a Large Number of Players Sample-Efficiently?
HTML
PDF
Ziang Song, Song Mei, Yu Bai
TL;DR
本文探讨了多人博弈中学习的样本复杂性问题, 并设计算法在样本复杂度多项式级别下, 求解多人一般和马尔可夫博弈的粗略关联均衡和关联均衡, 同时提出了针对特定条件下的学习算法, 显著提高了现有算法的效率和精度。
Abstract
multi-agent reinforcement learning
has made substantial empirical progresses in solving games with a large number of players. However, theoretically, the best known
sample complexity
for finding a
→