BriefGPT.xyz
Dec, 2020
同质神经网络适应性优化算法的隐含偏差
The Implicit Bias for Adaptive Optimization Algorithms on Homogeneous Neural Networks
HTML
PDF
Bohan Wang, Qi Meng, Wei Chen
TL;DR
研究表明采用指数移动平均策略的自适应算法如Adam和RMSProp可以最大化神经网络的边界,而直接在条件器中加历史平方梯度的AdaGrad却不行。
Abstract
Despite their overwhelming capacity to overfit,
deep neural networks
trained by specific
optimization algorithms
tend to generalize relatively well to unseen data. Recently, researchers explained it by investigat
→