BriefGPT.xyz
Jul, 2022
具丰富行动集的线性赌博机探索及其对推断的影响
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
HTML
PDF
Debangshu Banerjee, Avishek Ghosh, Sayak Ray Chowdhury, Aditya Gopalan
TL;DR
本研究给出了一个关于线性奖励算法设计矩阵特征光谱的非渐进下界,以及它对模型选择和聚类的应用。
Abstract
We present a non-asymptotic lower bound on the
eigenspectrum
of the design matrix generated by any
linear bandit algorithm
with sub-linear regret when the action set has
→