BriefGPT.xyz
Feb, 2023
受监督的多空间多粒度对齐视频文本检索
Video-Text Retrieval by Supervised Multi-Space Multi-Grained Alignment
HTML
PDF
Yimu Wang, Peng Shi
TL;DR
本研究提出了一种新的多空间多粒度监督学习框架SUMA,用于学习视频和文本之间的对齐表示空间,其中初始对齐空间由一定数量的概念聚类初始化。实验结果表明,SUMA相比现有方法具有更好的性能。
Abstract
While recent progress in
video-text retrieval
has been advanced by the exploration of better
representation learning
, in this paper, we present a novel
→