BriefGPT.xyz
Nov, 2020
随机不确定社交偏好中的紧急互惠和团队形成
Emergent Reciprocity and Team Formation from Randomized Uncertain Social Preferences
HTML
PDF
Bowen Baker
TL;DR
该研究通过引入随机不确定社交偏好(RUSP)的环境增强来训练多智能体以解决社交困境,证明了直接互惠、间接互惠与声誉的自然出现,包括团队形成,这些行为可带来更高的社会福利均衡。
Abstract
multi-agent reinforcement learning
(MARL) has shown recent success in increasingly complex fixed-team zero-sum environments. However, the real world is not zero-sum nor does it have fixed teams; humans face numerous
soc
→