BriefGPT.xyz
Jan, 2024
强化学习代理的紧急主导层级
Emergent Dominance Hierarchies in Reinforcement Learning Agents
HTML
PDF
Ram Rachum, Yonatan Nakar, Bill Tomlinson, Nitay Alon, Reuth Mirsky
TL;DR
现代强化学习算法在各种任务中能够超越人类表现。本文研究了多智能体强化学习环境中的一个基本社会约定:优势等级体系。通过人工智能代理,无需明确编程或内在奖励,我们证明了代理群体能够发明、学习、强化和传播优势等级体系,其结构与鸡、老鼠、鱼类和其他物种的研究相似。
Abstract
Modern
reinforcement learning
(RL) algorithms are able to outperform humans in a wide variety of tasks. Multi-agent
reinforcement learning
(MARL) settings present additional challenges, and successful cooperation
→