BriefGPT.xyz
Mar, 2018
关于现有动量方案在随机优化中的不足
On the insufficiency of existing momentum schemes for Stochastic Optimization
HTML
PDF
Rahul Kidambi, Praneeth Netrapalli, Prateek Jain, Sham M. Kakade
TL;DR
本论文通过证明存在简单的问题实例以及提出一种新的基于Nesterov的算法,来对现有的快速梯度方法在随机情况下的局限性以及不足进行研究。实验证明,该新算法比常见的方法更具优势。
Abstract
Momentum based
stochastic gradient methods
such as heavy ball (HB) and Nesterov's accelerated gradient descent (NAG) method are widely used in practice for training deep networks and other
supervised learning
mod
→