BriefGPT.xyz
Feb, 2023
马尔科夫博弈中的离线学习和一般函数逼近
Offline Learning in Markov Games with General Function Approximation
HTML
PDF
Yuheng Zhang, Yu Bai, Nan Jiang
TL;DR
研究离线多智体强化学习在马尔科夫博弈中学习近似均衡,提供适用于一般函数逼近的新框架以处理所有三种均衡,此框架利用 Bellman 一致压缩和数据覆盖条件,与之前的算法框架相比,其保证更好且能够处理更广泛的情况。
Abstract
We study offline
multi-agent reinforcement learning
(RL) in
markov games
, where the goal is to learn an approximate equilibrium -- such as Nash equilibrium and (Coarse) Correlated Equilibrium -- from an offline d
→