BriefGPT.xyz
Jul, 2014
多臂赌博机模型中最佳臂识别的复杂度
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
HTML
PDF
Emilie Kaufmann, Olivier Cappé, Aurélien Garivier
TL;DR
本文介绍了多臂老虎机模型的性能表现,并提供了特定情况下的下限和匹配算法。此外,还提供了改进的序贯停止规则以及两个独立的技术结果。
Abstract
The
stochastic multi-armed bandit model
is a simple abstraction that has proven useful in many different contexts in statistics and machine learning. Whereas the achievable limit in terms of
regret minimization
i
→