BriefGPT.xyz
Mar, 2019
深度网络中的动力学和泛化理论 III
Theory III: Dynamics and Generalization in Deep Networks
HTML
PDF
Andrzej Banburski, Qianli Liao, Brando Miranda, Lorenzo Rosasco, Bob Liang...
TL;DR
本研究通过分析深度神经网络的梯度下降技术实现,提出了控制网络复杂度的隐含规范化方法,并将其归纳为梯度下降算法的内在偏差,说明这种方法可以解决深度学习中过拟合的问题。
Abstract
We review recent observations on the dynamical systems induced by
gradient descent
methods used for training
deep networks
and summarize properties of the solutions they converge to. Recent results illuminate the
→