BriefGPT.xyz
Oct, 2023
从稳定到混沌:分析二次回归中的梯度下降动态
From Stability to Chaos: Analyzing Gradient Descent Dynamics in Quadratic Regression
HTML
PDF
Xuxing Chen, Krishnakumar Balasubramanian, Promit Ghosal, Bhavya Agrawalla
TL;DR
通过对大步长梯度下降在二次回归模型中的动力学进行全面调查,揭示了动力学可以由特定的三次映射来描述,并通过细致的分叉分析划分了五个不同的训练阶段,同时研究了非单调和非发散阶段的泛化性能。
Abstract
We conduct a comprehensive investigation into the dynamics of
gradient descent
using large-order constant step-sizes in the context of
quadratic regression models
. Within this framework, we reveal that the dynami
→