固定置信度下的最优臂识别

Feb, 2016

Optimal Best Arm Identification with Fixed Confidence

Aurélien Garivier, Emilie Kaufmann

TL;DR本研究完整表征了单参数赌博机问题中最优臂识别的复杂度，并提出了一种被称作“Track-and-Stop”的策略，该策略通过的新采样规则和所提出的 Chernoff 停止规则被证明是渐近最优的，并在样本复杂度上取得了一个新的紧致下界。

Abstract

We provide a complete characterization of the complexity of best-arm identification in one-parameter bandit problems. We prove a new, tight lower bound on the →