结构化随机线性Bandits

Jun, 2016

Structured Stochastic Linear Bandits

Nicholas Johnson, Vidyashankar Sivakumar, Arindam Banerjee

TL;DR研究如何在处理具有结构属性的未知参数（例如稀疏、分组稀疏、低秩）的随机线性Bandit问题中构建置信椭圆，以达到更紧密的置信度范围和更尖锐的失误边界。

Abstract

The stochastic linear bandit problem proceeds in rounds where at each round the algorithm selects a vector from a decision set after which it receives a noisy linear loss parameterized by an unknown vector. The goal in such a problem is to minimize the (pseudo) regret which is the diff