BriefGPT.xyz
Apr, 2023
用类热噪声绘制神经网络景观的地形图
Charting the Topography of the Neural Network Landscape with Thermal-Like Noise
HTML
PDF
Theo Jules, Gal Brener, Tal Kachman, Noam Levi, Yohai Bar-Sinai
TL;DR
通过采用统计力学的方法,我们研究一个超参数全连接的神经网络分类任务的优化过程,发现该过程与热力学中的温度有类似的波动统计,确定了低误差区域为低维流形,且该维度由决策边界的附近数据点的数量控制,并解释了在高温下主要采样弯曲程度较大的地区的原因。
Abstract
The training of
neural networks
is a complex, high-dimensional, non-convex and noisy
optimization
problem whose theoretical understanding is interesting both from an applicative perspective and for fundamental re
→