BriefGPT.xyz
May, 2024
长文本生成AI的统一序列并行算法
A Unified Sequence Parallelism Approach for Long Context Generative AI
HTML
PDF
Jiarui Fang, Shangchun Zhao
TL;DR
通过比较序列并行性的通信和内存成本,本文提出了一种统一的序列并行性方法,适用于Transformer模型架构和网络硬件拓扑,实现了对长序列的生成AI模型的更好性能。
Abstract
sequence parallelism
(SP), which divides the sequence dimension of input tensors across multiple computational devices, is becoming key to unlocking the long-context capabilities of
generative ai models
. This pap
→