BriefGPT.xyz
Jul, 2016
基于核的赌博式凸优化方法
Kernel-based methods for bandit convex optimization
HTML
PDF
Sébastien Bubeck, Ronen Eldan, Yin Tat Lee
TL;DR
提出一种新的算法解决在无导数情况下的$adversarial convex bandit$ 问题,其包含了核方法、伯努利卷积的一般化和新的退火时间表。这个算法在要求多次迭代的场景中可以取得佳效果。
Abstract
We consider the
adversarial convex bandit problem
and we build the first $\mathrm{poly}(T)$-time algorithm with $\mathrm{poly}(n) \sqrt{T}$-
regret
for this problem. To do so we introduce three new ideas in the
→