BriefGPT.xyz
Feb, 2024
VideoMAC: 视频蒙版自动编码器与卷积神经网络相遇
VideoMAC: Video Masked Autoencoders Meet ConvNets
HTML
PDF
Gensheng Pei, Tao Chen, Xiruo Jiang, Huafeng Liu, Zeren Sun...
TL;DR
这篇论文介绍了一种名为VideoMAC的新方法,结合了对视频帧进行对称遮罩的视频自编码器和资源友好的ConvNets,以及一种称为MVM的简单而有效的遮罩视频建模方法,通过在下游任务中的表现超过了基于ViT的方法。
Abstract
Recently, the advancement of
self-supervised learning
techniques, like
masked autoencoders
(MAE), has greatly influenced visual representation learning for images and videos. Nevertheless, it is worth noting that
→