BriefGPT.xyz
Jun, 2022
共享有限容量臂的多次随机赌博机
Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms
HTML
PDF
Xuchuang Wang, Hong Xie, John C. S. Lui
TL;DR
研究了多臂赌博机问题中的可共享臂设置,提出了一个用于评估可共享臂容量的估计器以及一个在线学习算法,并验证了其在5G和4G基站选择中的有效性。
Abstract
We generalize the multiple-play
multi-armed bandits
(MP-MAB) problem with a
shareable arm setting
, in which several plays can share the same arm. Furthermore, each shareable arm has a finite reward capacity and a
→