BriefGPT.xyz
Oct, 2017
SGD 学习过参数化的网络,并可应用于线性可分数据的证明泛化
SGD Learns Over-parameterized Networks that Provably Generalize on Linearly Separable Data
HTML
PDF
Alon Brutzkus, Amir Globerson, Eran Malach, Shai Shalev-Shwartz
TL;DR
通过学习两层的过参数化神经网络在使用Leaky ReLU激活函数的情况下,为SGD进行了优化和泛化的保证,具有独立于网络规模的泛化保证。
Abstract
neural networks
exhibit good
generalization
behavior in the
over-parameterized regime
, where the number of network parameters exceeds the
→