BriefGPT.xyz
Feb, 2019
多人赌博机: 对抗情形
Multi-Player Bandits: The Adversarial Case
HTML
PDF
Pragnya Alatur, Kfir Y. Levy, Andreas Krause
TL;DR
设计了第一个能够在任意变化的环境中工作的多人赌博算法,其中武器的损失甚至可能是由对手选择的,同时解决了Rosenski、Shamir和Szlak(2016年)提出的一个悬而未决的问题。
Abstract
We consider a setting where multiple players sequentially choose among a common set of actions (arms). Motivated by a
cognitive radio networks
application, we assume that players incur a loss upon
colliding
, and
→