BriefGPT.xyz
Aug, 2023
关于固定预算下二臂赌博机最优臂识别的统一最优算法
On Uniformly Optimal Algorithms for Best Arm Identification in Two-Armed Bandits with Fixed Budget
HTML
PDF
Po-An Wang, Kaito Ariu, Alexandre Proutiere
TL;DR
固定预算下的随机双臂赌博机最佳臂识别问题中,不存在优于均匀采样算法的算法,该问题的解决方案是引入一类称为“一致稳定算法”的自然算法,并证明该类算法与均匀采样算法的性能相同。
Abstract
We study the problem of
best-arm identification
with
fixed budget
in
stochastic two-arm bandits
with Bernoulli rewards. We prove that surp
→