多人赌博机: 对抗情形

Feb, 2019

Multi-Player Bandits: The Adversarial Case

Pragnya Alatur, Kfir Y. Levy, Andreas Krause

TL;DR设计了第一个能够在任意变化的环境中工作的多人赌博算法，其中武器的损失甚至可能是由对手选择的，同时解决了Rosenski、Shamir和Szlak（2016年）提出的一个悬而未决的问题。

Abstract

We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a cognitive radio networks application, we assume that players incur a loss upon colliding, and