BriefGPT.xyz
Mar, 2024
文本是MASS: 用于文本-视频检索的随机嵌入建模
Text Is MASS: Modeling as Stochastic Embedding for Text-Video Retrieval
HTML
PDF
Jiamian Wang, Guohao Sun, Pichao Wang, Dongfang Liu, Sohail Dianat...
TL;DR
该研究提出了一种新的文本建模方法T-MASS,通过将文本建模为随机嵌入,丰富了文本嵌入的语义范围,并在准确检索时利用了文本质量,从而在五个基准数据集上取得了最先进的性能。
Abstract
The increasing prevalence of video clips has sparked growing interest in
text-video retrieval
. Recent advances focus on establishing a joint
embedding
space for text and video, relying on consistent
→