TL;DR本研究通过在 Lux AI v2 Kaggle 比赛中应用 RL,使用一种集中式方法来训练 RL 代理,并报告了沿途的多个设计决策,以控制多种类型的变量大小编队群,从而解决多优化问题。
Abstract
multi-agent reinforcement learning (MARL) studies the behaviour of multiple
learning agents that coexist in a shared environment. MARL is more challenging
than single-agent RL because it involves more complex learning dynamics: the
observations and rewards of each agent are functions o