TL;DR本文提出了一种基于视频物体痕迹检测管道 MEGA 和 deepSORT 的轨迹提议方法,应用于VidVRD中,其中设计了基于轨迹的视觉Transformer,包含时间感知解码器,最终预测关系,实验结果表明了其在Video Relation Understanding上的优越性。
Abstract
video visual relation detection (VidVRD), has received significant attention of our community over recent years. In this paper, we apply the state-of-the-art video object tracklet detection pipeline mega and