BriefGPT.xyz
Jun, 2018
深度网络中的泛化(IIIb理论)
Theory IIIb: Generalization in Deep Networks
HTML
PDF
Tomaso Poggio, Qianli Liao, Brando Miranda, Andrzej Banburski, Xavier Boix...
TL;DR
该论文研究了深度神经网络中过拟合的问题,证明了使用特定的损失函数时神经网络的收敛性及性能,提出了一种实用的判断不同零最小化点泛化性能的方法。
Abstract
A main puzzle of
deep neural networks
(DNNs) revolves around the apparent absence of "
overfitting
", defined in this paper as follows: the expected error does not get worse when increasing the number of neurons or
→