BriefGPT.xyz
May, 2018
基于策略梯度的可扩展集中化深度多智体强化学习
Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients
HTML
PDF
Arbaaz Khan, Clark Zhang, Daniel D. Lee, Vijay Kumar, Alejandro Ribeiro
TL;DR
探索使用强化学习解决多智能体问题,将多智能体强化学习问题视为分布式优化问题处理,假设多智能体群体中每个智能体的策略在参数空间中相近且可以用单一策略代替,结果表明该算法在协作和竞争任务上比现有方法更加有效。
Abstract
In this paper, we explore using
deep reinforcement learning
for problems with multiple agents. Most existing methods for deep
multi-agent
reinforcement learning consider only a small number of agents. When the nu
→