BriefGPT.xyz
Jun, 2021
马尔可夫潜在博弈中多智能体策略梯度的全局收敛
Global Convergence of Multi-Agent Policy Gradient in Markov Potential Games
HTML
PDF
Stefanos Leonardos, Will Overman, Ioannis Panageas, Georgios Piliouras
TL;DR
本研究提出了一种新的马尔可夫潜势博弈(MPG)的定义,用于捕捉复杂的多智能体协调。结果表明,独立策略梯度可以快速收敛到纳什均衡策略。
Abstract
potential games
are arguably one of the most important and widely studied classes of normal form games. They define the archetypal setting of
multi-agent coordination
as all agent utilities are perfectly aligned
→