BriefGPT.xyz
Sep, 2020
神经网络训练的最小作用量原理
A Principle of Least Action for the Training of Neural Networks
HTML
PDF
Skander Karkar, Ibrahhim Ayed, Emmanuel de Bézenac, Patrick Gallinari
TL;DR
通过将神经网络视为一种随时间推移的动力系统,我们发现网络的输运映射中存在低动能位移偏差,并将其与泛化性能相关联,从而提出了一种新的学习算法,该算法可自动适应给定任务的复杂度,并在低数据情况下产生具有高泛化能力的网络。
Abstract
neural networks
have been achieving high
generalization performance
on many tasks despite being highly over-parameterized. Since classical statistical learning theory struggles to explain this behavior, much effo
→