Wenchen Han, Shay Vargaftik, Michael Mitzenmacher, Brad Karp, Ran Ben Basat
TL;DR梯度聚合、分布式机器学习、梯度压缩、评估方法和系统性能。
Abstract
gradient aggregation has long been identified as a major bottleneck in today's large-scale distributed machine learning training systems. One promising solution to mitigate such bottlenecks is →