BriefGPT.xyz
Oct, 2022
二次回归模型表现出稳定边缘的逐渐加强
Second-order regression models exhibit progressive sharpening to the edge of stability
HTML
PDF
Atish Agarwala, Fabian Pedregosa, Jeffrey Pennington
TL;DR
本文研究了大步长梯度下降的特性,证明二阶回归模型中存在一种逐渐趋于稳定的过程,这一过程不仅仅局限于神经网络等复杂的高维非线性模型中,这可能是一种离散学习算法。
Abstract
Recent studies of
gradient descent
with large step sizes have shown that there is often a regime with an initial increase in the largest eigenvalue of the
loss hessian
(progressive sharpening), followed by a stab
→