BriefGPT.xyz
Nov, 2023
FlashFFTConv:基于张量核的长序列高效卷积
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
HTML
PDF
Daniel Y. Fu, Hermann Kumbong, Eric Nguyen, Christopher Ré
TL;DR
我们提出了FlashFFTConv,一个优化FFT卷积的方法,使用矩阵分解结构和稀疏卷积算法实现了长序列任务的加速,同时在计算资源限制下,提高了模型的性能和准确度。
Abstract
convolution models
with long filters have demonstrated state-of-the-art reasoning abilities in many long-sequence tasks but lag behind the most optimized Transformers in wall-clock time. A major bottleneck is the
fast f
→