BriefGPT.xyz
Jul, 2019
在分布式异步环境下调控动量
Taming Momentum in a Distributed Asynchronous Environment
HTML
PDF
Ido Hakimi, Saar Barkai, Moshe Gabel, Assaf Schuster
TL;DR
提出了一种名为DANA的技术,旨在解决分布式异步训练的梯度走样问题,可以加速深度神经网络的训练过程且不影响最终精度。该方法在CIFAR和ImageNet数据集上表现出比现有方法更好的性能。
Abstract
Although
distributed computing
can significantly reduce the training time of
deep neural networks
, scaling the training process while maintaining high efficiency and final accuracy is challenging. Distributed
→