BriefGPT.xyz
May, 2021
深度ResNet的过度参数化:零损失和平均场分析
Overparameterization of deep ResNet: zero loss and mean-field analysis
HTML
PDF
Zhiyan Ding, Shi Chen, Qin Li, Stephen Wright
TL;DR
研究无限深度和无限宽度下Residual神经网络中梯度下降和凸优化的等效性,得出当神经网络足够大时,ResNet的训练可以得到几乎没有误差的近似解决方案。
Abstract
Finding parameters in a deep neural network (NN) that fit training data is a nonconvex
optimization problem
, but a basic first-order optimization method (
gradient descent
) finds a global solution with perfect fit
→