BriefGPT.xyz
Aug, 2024
对抗攻击鲁棒的随机多臂赌博机
Stochastic Bandits Robust to Adversarial Attacks
HTML
PDF
Xuchuang Wang, Jinhang Zuo, Xutong Liu, John C. S. Lui, Mohammad Hajiesmaili
TL;DR
本文研究了对抗攻击具有鲁棒性的随机多臂赌博机算法,解决了攻击者在观察学习者行动后篡改奖励观测的问题。提出的算法在已知和未知攻击预算情况下均有效,显著降低了算法的遗憾界限,为提升算法在对抗环境中的稳定性提供了新思路。
Abstract
This paper investigates stochastic
Multi-Armed Bandit
algorithms that are robust to
Adversarial Attacks
, where an attacker can first observe the learner's action and {then} alter their reward observation. We stud
→