合作人工智能的广义信念

Jun, 2022

Generalized Beliefs for Cooperative AI

Darius Muglich, Luisa Zintgraf, Christian Schroeder de Witt, Shimon Whiteson, Jakob Foerster

TL;DR本研究提出了一种基于信念空间的策略学习模型，可以在测试时间解码和适应新颖的规约，从而显著提高各种策略池中的特定反应的搜索和训练，同时增强智能体规约的可解释性和可解释性。

Abstract

self-play is a common paradigm for constructing solutions in markov games that can yield optimal policies in collaborative settings. However, these policies often adopt highly-specialized conventions that make pl