BriefGPT.xyz
Jan, 2024
运动引导的令牌压缩用于高效的掩码视频建模
Motion Guided Token Compression for Efficient Masked Video Modeling
HTML
PDF
Yukun Feng, Yangming Shi, Fengze Liu, Tan Yan
TL;DR
通过提高FPS速率并使用MGTC方法,在视频理解方面取得了显著的性能提升,并在降低计算负担的同时保持了高的性能表现。
Abstract
Recent developments in
transformers
have achieved notable strides in enhancing
video comprehension
. Nonetheless, the O($N^2$) computation complexity associated with attention mechanisms presents substantial compu
→