BriefGPT.xyz
Jun, 2023
促进协作多智能体强化学习的层次任务网络规划
Hierarchical Task Network Planning for Facilitating Cooperative Multi-Agent Reinforcement Learning
HTML
PDF
Xuechen Mu, Hankz Hankui Zhuo, Chen Chen, Kai Zhang, Chao Yu...
TL;DR
本篇论文提出了 SOMARL 框架,利用符号知识嵌入 HTN 和元控制器中的 MARL 环境中,针对 FindTreasure 和 MoveBox 两种基准实验表现出比现有技术和基于子目标的基线更好的绩效。
Abstract
Exploring sparse reward
multi-agent reinforcement learning
(MARL) environments with
traps
in a collaborative manner is a complex task. Agents typically fail to reach the goal state and fall into
→