BriefGPT.xyz
Oct, 2016
QSGD: 通过梯度量化和编码实现通信高效的SGD
QSGD: Randomized Quantization for Communication-Optimal Stochastic Gradient Descent
HTML
PDF
Dan Alistarh, Jerry Li, Ryota Tomioka, Milan Vojnovic
TL;DR
提出了一种名为Quantized SGD的压缩梯度下降的算法,使用该算法可以在降低通信代价的同时保证收敛,且在图像分类和自动语音识别等多个实验中表现优异。
Abstract
Parallel implementations of
stochastic gradient descent
(SGD) have received significant research attention, thanks to excellent scalability properties of this algorithm, and to its efficiency in the context of training
→