BriefGPT.xyz
Feb, 2020
分布式梯度方法:拓扑结构是否重要?
Decentralized gradient methods: does topology matter?
HTML
PDF
Giovanni Neglia, Chuan Xu, Don Towsley, Gianmarco Calbi
TL;DR
该论文研究了分布式优化方法中工作通讯拓扑对收敛速度的影响,并提出通过使用稀疏拓扑来提高收敛速度的方法。
Abstract
Consensus-based
distributed optimization
methods have recently been advocated as alternatives to parameter server and ring all-reduce paradigms for large scale training of
machine learning models
. In this case, e
→