BriefGPT.xyz
Nov, 2017
行为受限零和博弈中的遗憾最小化
Regret Minimization in Behaviorally-Constrained Zero-Sum Games
HTML
PDF
Gabriele Farina, Christian Kroer, Tuomas Sandholm
TL;DR
使用 CFR 框架开发算法以解决行为约束的 extensive-form games,同时计算近似 Nash 平衡改进。比标准算法更好,收敛速率与最先进的 Nash 平衡计算算法相当。
Abstract
no-regret learning
has emerged as a powerful tool for solving
extensive-form games
. This was facilitated by the
counterfactual-regret minimizatio
→