BriefGPT.xyz
Jun, 2020
熵梯度下降算法与宽平坦最小值
Entropic gradient descent algorithms and wide flat minima
HTML
PDF
Fabrizio Pittorino, Carlo Lucibello, Christoph Feinauer, Enrico M. Malatesta, Gabriele Perugini...
TL;DR
论文讨论了神经网络的经验风险景观的平坦极小值的特性,提出了增加最大平坦度算法,可以得到更好的分类效果。
Abstract
The properties of
flat minima
in the empirical risk landscape of
neural networks
have been debated for some time. Increasing evidence suggests they possess better
→