BriefGPT.xyz
Nov, 2021
组合半匪难度分析及Thompson抽样策略与贪心算法的应用
The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle
HTML
PDF
Fang Kong, Yueran Yang, Wei Chen, Shuai Li
TL;DR
本文研究了在贪心算法下Thompson sampling(TS)在组合多臂赌博问题(CMAB)中的行为,证明了TS可以在近似算法的预测下解决CMAB问题,并提供了渐近匹配的后悔上限。
Abstract
thompson sampling
(TS) has attracted a lot of interest in the bandit area. It was introduced in the 1930s but has not been theoretically proven until recent years. All of its analysis in the
combinatorial multi-armed ba
→