BriefGPT.xyz
Feb, 2023
随机博弈的高效Q学习
Efficient-Q Learning for Stochastic Games
HTML
PDF
Muhammed O. Sayin, Onur Unlu
TL;DR
本文提出了新的高效Q学习动态应用于随机博弈,使智能体能够遵循阶段游戏中的对数线性学习动态,通过逐步迭代估计Q函数,实现高效平衡,并通过逐渐减小步长的方式使其收敛,同时还研究了 softmax 响应在此过程中产生的近似误差。
Abstract
We present the new efficient-Q learning dynamics for
stochastic games
beyond the recent concentration of progress on provable convergence to possibly inefficient equilibrium. We let agents follow the
log-linear learning
→