BriefGPT.xyz
Nov, 2024
无悔学习中的福利最大化障碍
Barriers to Welfare Maximization with No-Regret Learning
HTML
PDF
Ioannis Anagnostides, Alkis Kalavasis, Tuomas Sandholm
TL;DR
本研究针对在无悔学习中如何达到均衡的问题,提出了对两人(一般总和)博弈中计算下界的首次探讨,强调了所需迭代次数。研究者通过证明计算近似最优 $T$ 稀疏协调均衡的下界,从而限制了无悔学习的迭代复杂性,并表明最大团的不可近似性阻碍了在多项式时间内实现任何非平凡稀疏化的可能性。
Abstract
A celebrated result in the interface of online learning and
Game theory
guarantees that the repeated interaction of no-regret players leads to a
Coarse correlated equilibrium
(CCE) -- a natural game-theoretic sol
→