BriefGPT.xyz
Dec, 2023
PipeOptim: 保证具有依赖优化器的权重预测的有效1F1B调度
PipeOptim: Ensuring Effective 1F1B Schedule with Optimizer-Dependent Weight Prediction
HTML
PDF
Lei Guan, Dongsheng Li, Jiye Liang, Wenjian Wang, Xicheng Lu
TL;DR
本文提出了一种基于优化器相关的权重预测策略(称为PipeOptim),用于解决异步管道训练中的权重一致性和陈旧性问题,以确保高吞吐量和有效的参数学习,实验证明PipeOptim优于其他流行的管道方法。
Abstract
asynchronous pipeline model parallelism
with a "1F1B" (one forward, one backward) schedule generates little bubble overhead and always provides quite a high throughput. However, the "1F1B" schedule inevitably leads to
w
→