BriefGPT.xyz
Jun, 2017
深度学习泛化理解:损失景观的视角
Towards Understanding Generalization of Deep Learning: Perspective of Loss Landscapes
HTML
PDF
Lei Wu, Zhanxing Zhu, Weinan E
TL;DR
研究表明,深度神经网络模型具有很好的泛化能力,其优秀的泛化能力是来自于损失函数的加权局部最小值及其优化方法。
Abstract
It is widely observed that
deep learning
models with learned parameters generalize well, even with much more model parameters than the number of training samples. We systematically investigate the underlying reasons why deep
→