BriefGPT.xyz
Jun, 2015
随机梯度下降及其异步变体中的方差降低
On Variance Reduction in Stochastic Gradient Descent and its Asynchronous Variants
HTML
PDF
Sashank J. Reddi, Ahmed Hefny, Suvrit Sra, Barnabás Póczos, Alex Smola
TL;DR
该研究探讨了基于方差缩减的优化算法,尤其是异步版本的SVRG和SAGA在机器学习中的应用和实验表现。研究结果表明,该方法在稀疏设置下实现了近线性加速。
Abstract
We study
optimization algorithms
based on
variance reduction
for
stochastic gradient descent
(SGD). Remarkable recent progress has been ma
→