BriefGPT.xyz
Jun, 2017
多智能体演员-评论家在混合协作竞争环境下的应用
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
HTML
PDF
Ryan Lowe, Yi Wu, Aviv Tamar, Jean Harb, Pieter Abbeel...
TL;DR
本文研究深度强化学习在多智能体领域的应用,提出一种基于演员-评论家方法的适应性策略,可成功学习需要多智能体协作的复杂策略,并通过使用每个智能体的策略集进行训练,得到了更强大、更健壮的策略。在合作和竞争场景中,我们的方法相比现有方法能够发现各种物理和信息协调策略。
Abstract
We explore
deep reinforcement learning
methods for
multi-agent domains
. We begin by analyzing the difficulty of traditional algorithms in the multi-agent case: Q-learning is challenged by an inherent non-stationa
→