BriefGPT.xyz
Dec, 2019
线性模式连通性与彩票票证假说
Linear Mode Connectivity and the Lottery Ticket Hypothesis
HTML
PDF
Jonathan Frankle, Gintare Karolina Dziugaite, Daniel M. Roy, Michael Carbin
TL;DR
研究神经网络优化是否在不同的 SGD 噪声样本下优化到相同的线性连接最小值; 发现标准视觉模型在训练早期就变得稳定了,IMP 只有在稳定下来SGD噪声时才能达到完全准确性。
Abstract
We introduce "instability analysis," a framework for assessing whether the outcome of optimizing a
neural network
is robust to
sgd noise
. It entails training two copies of a network on different random data order
→