BriefGPT.xyz
Jun, 2019
广义线性臂带问题中的随机探索
Randomized Exploration in Generalized Linear Bandits
HTML
PDF
Branislav Kveton, Manzil Zaheer, Csaba Szepesvari, Lihong Li, Mohammad Ghavamzadeh...
TL;DR
研究广义线性臂选择算法的两种随机算法:GLM-TSL和GLM-FPL,并提供了对它们的$\tilde{O}(d\sqrt{n \log K})$遗憾度性能保证,这两种算法在逻辑回归和神经网络算法中表现出色并明显更快。
Abstract
We study two randomized algorithms for
generalized linear bandits
,
glm-tsl
and
glm-fpl
.
→