BriefGPT.xyz
Jun, 2022
视频模型中的独立帧间关注
Stand-Alone Inter-Frame Attention in Video Models
HTML
PDF
Fuchen Long, Zhaofan Qiu, Yingwei Pan, Ting Yao, Jiebo Luo...
TL;DR
本文提出一种名为SIFA的新型帧间注意力机制,能够有效地捕捉帧间形变信息,应用于ConvNets和Vision Transformer中成功构建SIFA-Net和SIFA-Transformer,并在多个视频数据集上进行实验,证明了SIFA-Net和SIFA-Transformer的有效性。
Abstract
motion
, as the uniqueness of a video, has been critical to the development of
video understanding
models. Modern deep learning models leverage
mo
→