BriefGPT.xyz
Sep, 2016
(有偏) 带噪声梯度预言的凸优化
(Bandit) Convex Optimization with Biased Noisy Gradient Oracles
HTML
PDF
Xiaowei Hu, Prashanth L. A., András György, Csaba Szepesvári
TL;DR
提出了一种新的框架用抽象的预言家代替了具体的梯度估计方法,并统一了以前的工作,同时表明为了实现最优的根号-n速率,这种方法还需要进一步的研究。
Abstract
Algorithms for
bandit convex optimization
and
online learning
often rely on constructing noisy gradient estimates, which are then used in appropriately adjusted first-order algorithms, replacing actual gradients.
→