BriefGPT.xyz
May, 2023
有限精度采样赌博机中的最佳臂识别
Best Arm Identification in Bandits with Limited Precision Sampling
HTML
PDF
Kota Srinivas Reddy, P. N. Karthik, Nikhil Karamchandani, Jayakrishnan Nair
TL;DR
研究了多臂赌博机问题中学习者在选择臂时精度受限的变体,并且给出了期望停留时间的渐近下限并提出了一种修改后的算法用于处理非唯一最优配置,并且针对在简单的情况下访问不重叠臂的情况给出了非渐近下限和上限。
Abstract
We study best arm identification in a variant of the
multi-armed bandit problem
where the learner has
limited precision
in arm selection. The learner can only sample arms via certain
→