BriefGPT.xyz
Jun, 2017
稀疏随机赌博机
Sparse Stochastic Bandits
HTML
PDF
Joon Kwon, Vianney Perchet, Claire Vernade
TL;DR
本文研究了经典多臂老虎机问题的稀疏情况,并提出了一种算法,其遗憾值与臂数的正比例关系被缩小至仅与正收益臂数相同,同时证明了其最优性。
Abstract
In the classical
multi-armed bandit problem
, d arms are available to the decision maker who pulls them sequentially in order to maximize his cumulative reward. Guarantees can be obtained on a relative quantity called
re
→