BriefGPT.xyz
Jun, 2023
SGD的精确均方线性稳定性分析
Exact Mean Square Linear Stability Analysis for SGD
HTML
PDF
Rotem Mulayoff, Tomer Michaeli
TL;DR
本文推导出了随机梯度下降法 (SGD)的稳定性阈值的显式表达式,并给出了与批量大小相关的最简单的必要稳定性条件。
Abstract
The dynamical stability of
optimization methods
at the vicinity of minima of the loss has recently attracted significant attention. For
gradient descent
(GD), stable convergence is possible only to minima that ar
→