BriefGPT.xyz
May, 2023
基于奖励机制的强化学习在随机博弈中的应用
Reinforcement Learning With Reward Machines in Stochastic Games
HTML
PDF
Jueming Hu, Jean-Raphaël Gaglione, Yanze Wang, Zhe Xu, Ufuk Topcu...
TL;DR
本文探讨了利用奖励机制来实现高级任务的多智能体强化学习算法QRM-SG,能在Nash平衡下在多智能体系统中学习最优策略,并且在三个案例研究中证明了其有效性。
Abstract
We investigate
multi-agent reinforcement learning
for
stochastic games
with complex tasks, where the reward functions are non-Markovian. We utilize
→