TL;DR本文研究了基于 Common Information 方法的多智能体随机控制问题,提出了一种新的算法 CHSVI 解决了协调器的 POMDP 可能出现的计算难题。
Abstract
The Common Information (CI) approach provides a systematic way to transform a multi-agent stochastic control problem to a single-agent partially observed markov decision problem (→