局部更新SGD中的最佳错误-运行时间平衡的自适应通信策略

Oct, 2018

局部更新SGD中的最佳错误-运行时间平衡的自适应通信策略

Adaptive Communication Strategies to Achieve the Best Error-Runtime Trade-off in Local-Update SGD

Jianyu Wang, Gauri Joshi

TL;DR本文介绍 AdaComm，一种自适应通信策略，可以更快地训练深度神经网络，使大规模机器学习训练更 robust 且具有更快的收敛速度。

Abstract

large-scale machine learning training, in particular distributed stochastic gradient descent, needs to be robust to inherent system variability such as node straggling and random communication delays. This work considers a →