BriefGPT.xyz
Apr, 2018
UCBoost: 对随机赌博机的复杂性和最优性进行增强的提升方法
UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits
HTML
PDF
Fang Liu, Sinong Wang, Swapna Buccapatnam, Ness Shroff
TL;DR
这篇论文提出了一种名为UCBoost的方法,用于解决多臂赌博算法在序贯决策问题中低复杂度且最优问题,该方法提供了一种可以在计算复杂度和最优性之间互相交换的方式。
Abstract
In this work, we address the open problem of finding low-complexity near-optimal
multi-armed bandit algorithms
for
sequential decision making
problems. Existing bandit algorithms are either sub-optimal and comput
→