BriefGPT.xyz
Jun, 2024
图管道:通过图管道并行提高DNN训练的性能和可扩展性
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism
HTML
PDF
Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park...
TL;DR
深度神经网络的管道并行化方法(GPP)以及分布式系统GraphPipe通过优化微批量进度和并行训练实现了对现有管道并行系统如PipeDream和Piper的提速和搜索时间的降低。
Abstract
deep neural networks
(DNNs) continue to grow rapidly in size, making them infeasible to train on a single device.
pipeline parallelism
is commonly used in existing DNN systems to support large-scale
→