BriefGPT.xyz
Dec, 2023
FlashVideo:快速从文本生成视频的框架
FlashVideo: A Framework for Swift Inference in Text-to-Video Generation
HTML
PDF
Bin Lei, le Chen, Caiwen Ding
TL;DR
FlashVideo是一种新颖框架,通过使用RetNet架构,将序列长度为L的推理时间复杂度从O(L^2)降低到O(L),从而显著加快推理速度,并且通过抛弃冗余帧插值方法来增强帧插值的效率,实现了相对传统自回归转换模型的9.17倍效率提升,并且推理速度与基于BERT的转换模型相当。
Abstract
In the evolving field of
machine learning
,
video generation
has witnessed significant advancements with
autoregressive-based transformer models
→