BriefGPT.xyz
Dec, 2015
多人博弈 -- 音乐椅子方法
Multi-Player Bandits -- a Musical Chairs Approach
HTML
PDF
Jonathan Rosenski, Ohad Shamir, Liran Szlak
TL;DR
本研究提出了两种无需通信的算法Musical Chairs和Dynamic Musical Chairs来解决多人博弈中的多臂赌博机问题,其中玩家可能发生碰撞,但不会获得奖励。这些算法有着恒定和次线性的遗憾率,且不需要先验知识,为这类问题解决提供了理论保证。
Abstract
We consider a variant of the
stochastic multi-armed bandit problem
, where multiple players simultaneously choose from the same set of arms and may collide, receiving no reward. This setting has been motivated by problems arising in
→