BriefGPT.xyz
May, 2023
单隐藏层神经网络梯度流性质与线性激活函数的研究
On the ISS Property of the Gradient Flow for Single Hidden-Layer Neural Networks with Linear Activations
HTML
PDF
Arthur Castello B. de Oliveira, Milad Siami, Eduardo D. Sontag
TL;DR
通过研究神经网络的超参数化和过拟合对梯度下降算法鲁棒性的影响,我们证明了过度参数化会引入伪平衡点,阻碍梯度下降算法的收敛。
Abstract
Recent research in
neural networks
and
machine learning
suggests that using many more parameters than strictly required by the initial complexity of a regression problem can result in more accurate or faster-conv
→