针对对抗性线性情境赌博机的高效稳健算法

Feb, 2020

针对对抗性线性情境赌博机的高效稳健算法

Efficient and Robust Algorithms for Adversarial Linear Contextual Bandits

Gergely Neu, Julia Olkhovskaya

TL;DR针对经典$K$-armed线性上下文对抗性问题，我们开发了基于Exp3算法的计算有效算法，其中包含实时算法和鲁棒算法，它们能够实现良好的失望保证，并且对于线性奖励函数而言具有稳健性。

Abstract

We consider an adversarial variant of the classic $K$-armed linear contextual bandit problem where the sequence of loss functions associated with each arm are allowed to change without restriction over time. Under the assumption that the $d$-dimensional contexts are generated i.i.d.~at random from a known distributions, we develop →