BriefGPT.xyz
Oct, 2018
局部更新SGD中的最佳错误-运行时间平衡的自适应通信策略
Adaptive Communication Strategies to Achieve the Best Error-Runtime Trade-off in Local-Update SGD
HTML
PDF
Jianyu Wang, Gauri Joshi
TL;DR
本文介绍 AdaComm,一种自适应通信策略,可以更快地训练深度神经网络,使大规模机器学习训练更 robust 且具有更快的收敛速度。
Abstract
large-scale machine learning
training, in particular distributed stochastic gradient descent, needs to be robust to inherent system variability such as node straggling and random communication delays. This work considers a
→