TL;DR本研究提出了一种名为 Heterogeneous League Training (HLT) 的通用强化学习算法,用于解决异构多智能体问题,试验结果表明 HLT 可以提高异构团队在合作任务中的成功率,是解决策略版本迭代问题的有效途径,提供了评估异构团队中每个角色难度的实际方法。
Abstract
Many multiagent systems in the real world include multiple types of agents with different abilities and functionality. Such heterogeneous multiagent systems have significant practical advantages. However, they also come with challenges compared with homogeneous systems for multiagent <