Aug, 2023
合作多智能体赌博机:优化个体懊悔并具有恒定通讯开销的分布式算法
Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs
Lin Yang, Xuchuang Wang, Mohammad Hajiesmaili, Lijun Zhang, John C.S. Lui...
TL;DR合作多智能体多臂赌博算法中的通信策略,既实现了最优个体遗憾,又具有恒定的通信成本。