BriefGPT.xyz
Jul, 2024
探索两层线性神经网络中基于时期的双重下降现象
Towards understanding epoch-wise double descent in two-layer linear neural networks
HTML
PDF
Amanda Olmin, Fredrik Lindsten
TL;DR
对两层线性神经网络中的epoch-wise双下降现象进行研究,通过推导出标准线性回归模型的学习动力学和具有二次权重的线性两层对角网络之间的梯度流,识别了额外的导致epoch-wise双下降的因素,进而引出了对真正深度模型的未知因素的进一步问题。
Abstract
epoch-wise double descent
is the phenomenon where
generalisation performance
improves beyond the point of
overfitting
, resulting in a gene
→