BriefGPT.xyz
Feb, 2019
更好的随机赌博机算法与对抗性干扰
Better Algorithms for Stochastic Bandits with Adversarial Corruptions
HTML
PDF
Anupam Gupta, Tomer Koren, Kunal Talwar
TL;DR
研究了存在对抗性污染的随机多臂赌博机问题,在此问题上提出了一种新算法,其遗憾几乎是最优的,相对于以前的工作有显著的改进。我们的算法对对抗污染的程度是不可知的,并且可以承受相当大的污染,几乎不会降低性能。
Abstract
We study the
stochastic multi-armed bandits
problem in the presence of
adversarial corruption
. We present a
new algorithm
for this problem
→