BriefGPT.xyz
Oct, 2019
神经损失景观的局部几何的新兴特性
Emergent properties of the local geometry of neural loss landscapes
HTML
PDF
Stanislav Fort, Surya Ganguli
TL;DR
本文通过实验和理论研究了神经网络的波动,发现高维神经网络的损失函数曲面具有多方向高正曲率、梯度下降具有狭窄、随机位于此曲面中不同位置处的超平面理论能够解释背后的机理。
Abstract
The local geometry of high dimensional
neural network
loss landscapes can both challenge our cherished theoretical intuitions as well as dramatically impact the practical success of
neural network
training. Indee
→