BriefGPT.xyz
Dec, 2024
马尔可夫博弈的近似状态抽象
Approximate State Abstraction for Markov Games
HTML
PDF
Hiroki Ishibashi, Kenshi Abe, Atsushi Iwasaki
TL;DR
本文解决了两人零和马尔可夫博弈(TZMGs)中由于状态数量增加导致均衡计算困难的问题。通过将多个不同状态视为一个状态的方式进行状态抽象,提出了一种新颖的方法,并通过推导对偶间隙界限来评估状态抽象游戏的均衡解。实验结果显示,该方法在马尔可夫足球游戏中有效计算了均衡策略,具有重要的应用潜力。
Abstract
This paper introduces
State Abstraction
for two-player zero-sum
Markov Games
(TZMGs), where the payoffs for the two players are determined by the state representing the environment and their respective actions, w
→