BriefGPT.xyz
Feb, 2018
VR-SGD: 一种简单的随机方差缩减机器学习方法
VR-SGD: A Simple Stochastic Variance Reduction Method for Machine Learning
HTML
PDF
Fanhua Shang, Kaiwen Zhou, James Cheng, Ivor W. Tsang, Lijun Zhang...
TL;DR
本文提出了一种名为VR-SGD的变体随机梯度下降法,其使用平均值和上一个时期的最后迭代作为两个向量,能够直接解决非光滑和/或非强凸问题,并能够使用更大的学习率。此方法在解决各种机器学习问题,如凸和非凸的经验风险最小化以及特征值计算等方面,具有更快的收敛速度。
Abstract
In this paper, we propose a simple variant of the original SVRG, called variance reduced
stochastic gradient descent
(
vr-sgd
). Unlike the choices of snapshot and starting points in SVRG and its proximal variant,
→