BriefGPT.xyz
Jun, 2022
合作人工智能的广义信念
Generalized Beliefs for Cooperative AI
HTML
PDF
Darius Muglich, Luisa Zintgraf, Christian Schroeder de Witt, Shimon Whiteson, Jakob Foerster
TL;DR
本研究提出了一种基于信念空间的策略学习模型,可以在测试时间解码和适应新颖的规约,从而显著提高各种策略池中的特定反应的搜索和训练,同时增强智能体规约的可解释性和可解释性。
Abstract
self-play
is a common paradigm for constructing solutions in
markov games
that can yield optimal policies in collaborative settings. However, these policies often adopt highly-specialized conventions that make pl
→