BriefGPT.xyz
Jul, 2023
VITS:上下文推测中的变分推断汤姆逊采样
VITS : Variational Inference Thomson Sampling for contextual bandits
HTML
PDF
Pierre Clavier, Tom Huix, Alain Durmus
TL;DR
该论文介绍和分析了一种上下文赌博问题的变体的汤普森采样(TS)算法,提出了一种基于高斯变分推理的新算法 VITS,并通过实验展示了其在合成和真实世界数据集上的有效性。
Abstract
In this paper, we introduce and analyze a variant of the
thompson sampling
(TS) algorithm for
contextual bandits
. At each round, traditional TS requires samples from the current posterior distribution, which is u
→