BriefGPT.xyz
Feb, 2024
神经排序崩溃:权重衰减和小的内类变异性带来低秩偏差
Neural Rank Collapse: Weight Decay and Small Within-Class Variability Yield Low-Rank Bias
HTML
PDF
Emanuele Zangrando, Piero Deidda, Simone Brugiapaglia, Nicola Guglielmi, Francesco Tudisco
TL;DR
深度学习中的低秩偏好与神经网络的神经层塌陷现象相关,权重衰减参数的增长导致网络中每一层的秩与前一层隐藏空间嵌入的类内变异成正比减少。
Abstract
Recent work in
deep learning
has shown strong empirical and theoretical evidence of an implicit
low-rank bias
: weight matrices in deep networks tend to be approximately low-rank and removing relatively small sing
→