BriefGPT.xyz
May, 2022
马尔可夫潜在博弈中的独立和去中心化学习
Independent and Decentralized Learning in Markov Potential Games
HTML
PDF
Chinmay Maheshwari, Manxi Wu, Druv Pai, Shankar Sastry
TL;DR
该论文提出了一种多智能体强化学习动态模型,分析了其在无限期贴现马尔可夫潜在博弈中的收敛性质。论文在独立和分散的环境下进行,重点研究了多智能体可以通过简单的学习动态方法在最小信息环境下达到马尔可夫潜在博弈的稳定纳什均衡。
Abstract
We propose a
multi-agent reinforcement learning
dynamics, and analyze its convergence properties in infinite-horizon discounted
markov potential games
. We focus on the independent and
→