CVPRJun, 2022

2022 年 Ego4D PNR 时序定位挑战赛结构化视频令牌

TL;DRSViT method proposes StructureViT to improve temporal localization by utilizing object tokens and enforcing frame-clip consistency, achieving a strong performance of 0.656 absolute error on Point of No Return challenge test set.