BriefGPT.xyz
Oct, 2019
深度残差网络过参数化情况下的算法依赖性泛化界
Algorithm-Dependent Generalization Bounds for Overparameterized Deep Residual Networks
HTML
PDF
Spencer Frei, Yuan Cao, Quanquan Gu
TL;DR
通过分析过度参数化的深层残差网络,我们证明了梯度下降学习的网络类是整个神经网络函数类的一个子集,这个子集足够大以保证小的训练误差和测试误差,并且此类网络具有小的泛化差距,这提供了残差网络优于非残差网络的解释。
Abstract
The
skip-connections
used in
residual networks
have become a standard architecture choice in
deep learning
due to the increased training s
→