BriefGPT.xyz
Dec, 2014
用随机游走初始化训练超深度前馈网络
Random Walks: Training Very Deep Nonlinear Feed-Forward Networks with Smart Initialization
HTML
PDF
David Sussillo
TL;DR
该研究论文探讨了在机器学习中训练深度网络的困难之处,并提出了一种方法解决梯度消失问题,即适当增加各层的宽度以缓解问题。
Abstract
Training very
deep networks
is an important open problem in machine learning. One of many difficulties is that the norm of the back-propagated
gradient
can grow or decay exponentially. Here we show that training
→