BriefGPT.xyz
May, 2023
多智能体强化学习中的语义对齐任务分解
Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
HTML
PDF
Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin...
TL;DR
本研究提出了一种新的分解任务和子目标分配的决策方法--SAMA。SAMA使用预训练的语言模型,结合语言基础强化学习来训练子目标条件策略,相较于现有的ASG方法,SAMA具有更高的样本效率。
Abstract
The difficulty of appropriately assigning credit is particularly heightened in cooperative
marl
with sparse reward, due to the concurrent time and structural scales involved.
automatic subgoal generation
(ASG) ha
→