BriefGPT.xyz
Feb, 2019
深度网络极小值的尺度不变平坦度量
A Scale Invariant Flatness Measure for Deep Network Minima
HTML
PDF
Akshay Rangamani, Nam H. Nguyen, Abhishek Kumar, Dzung Phan, Sang H. Chin...
TL;DR
通过提出基于海森矩阵的浅度测量,在深度网络训练中检验了大批量SGD最小值确实比小批量SGD最小值更锐利,并且我们证明了正同态激活的深度网络的等价关系在参数空间中的商流形结构,并提出了一种具有等价不变性的测量平坦度的方法。
Abstract
It has been empirically observed that the flatness of minima obtained from training
deep networks
seems to correlate with better
generalization
. However, for
→