通信对非合作式多玩家多臂赌博问题的影响

Nov, 2017

The Effect of Communication on Noncooperative Multiplayer Multi-Armed Bandit Problems

Noyan Evirgen, Alper Kose

TL;DR本研究考虑了多个玩家之间，通过Erdos-Renyi图，以不同的通信概率下的去中心化随机多臂赌博问题，使用UCB1、epsilon-Greedy和Thompson Sampling算法探究了玩家之间的连接度对累计遗憾的影响。

Abstract

We consider decentralized stochastic multi-armed bandit problem with multiple players in the case of different communication probabilities