BriefGPT.xyz
May, 2023
游戏学习对学习者是否有益?
Is Learning in Games Good for the Learners?
HTML
PDF
William Brown, Jon Schneider, Kiran Vodrahalli
TL;DR
研究了两个智能体在重复对局中报酬和悔恨之间的权衡,提出了一种广义均衡概念,讨论了不同对手情况下的最优战略和可行方案,探究了利用这种广义均衡学习最优策略的方法。
Abstract
We consider a number of questions related to tradeoffs between
reward
and
regret
in repeated
gameplay
between two agents. To facilitate th
→