BriefGPT.xyz
Jul, 2020
抗对抗攻击的随机线性臂机算法
Stochastic Linear Bandits Robust to Adversarial Attacks
HTML
PDF
Ilija Bogunovic, Arpan Losalka, Andreas Krause, Jonathan Scarlett
TL;DR
研究了随机线性赌博机问题,考虑了对抗攻击,提出了两种Robust Phased Elimination算法,证明了在非污染情况下可以获得近似最优的收益,并得出针对这些算法的相对近似最优的加性项。同时,在具有多样化情境的情况下,表明一种简单的贪婪算法是稳健的,近似最优的加性遗憾项,尽管不进行明确的探索并且不知道C。
Abstract
We consider a
stochastic linear bandit problem
in which the rewards are not only subject to random noise, but also
adversarial attacks
subject to a suitable budget $C$ (i.e., an upper bound on the sum of corrupti
→