BriefGPT.xyz
Jun, 2023
深度网络中的显式和隐式正则化结合,实现高效学习
Combining Explicit and Implicit Regularization for Efficient Learning in Deep Networks
HTML
PDF
Dan Zhao
TL;DR
通过研究隐性正则化的梯度轨迹,借鉴深度线性网络梯度下降隐式正则化向低秩解的偏好性,并提出显式惩罚来模拟这种偏好,从而使单层网络可以达到深度线性网络相同的低秩逼近性能。
Abstract
Works on
implicit regularization
have studied gradient trajectories during the optimization process to explain why deep networks favor certain kinds of solutions over others. In
deep linear networks
, it has been
→