BriefGPT.xyz
Jun, 2023
超越隐性偏见: SGD噪声在在线学习中的无关性
Beyond Implicit Bias: The Insignificance of SGD Noise in Online Learning
HTML
PDF
Nikhil Vyas, Depen Morwani, Rosie Zhao, Gal Kaplun, Sham Kakade...
TL;DR
通过对图像和语言数据的广泛实证分析,我们表明在在线学习中,大的学习速率和小的批次大小并不能为 SGD 带来任何隐式偏差优势。
Abstract
The success of SGD in deep learning has been ascribed by prior works to the implicit bias induced by high
learning rate
or small
batch size
("
sgd
→