BriefGPT.xyz
Jan, 2022
在合作与拜占庭式分散团队中使用互信息进行迭代推理
Iterated Reasoning with Mutual Information in Cooperative and Byzantine Decentralized Teaming
HTML
PDF
Sachin Konan, Esmaeil Seraj, Matthew Gombolay
TL;DR
本文提出InfoPG算法,以最大化相互信息来优化多智能体协作决策,有效地在多个复杂任务中提高了学习效率和总奖励。
Abstract
information sharing
is key in building
team cognition
and enables coordination and cooperation. High-performing human teams also benefit from acting strategically with hierarchical levels of iterated communicatio
→