BriefGPT.xyz
Aug, 2023
DiffusionVMR:视频时刻检索的扩散模型
DiffusionVMR: Diffusion Model for Video Moment Retrieval
HTML
PDF
Henghao Zhao, Kevin Qinghong Lin, Rui Yan, Zechao Li
TL;DR
该研究提出了一种名为DiffusionVMR的提议无关框架,通过将视频时刻检索重新构想为去噪生成过程,直接从噪声中采样随机时段作为候选,并引入去噪学习以确定目标时刻。实验证明DiffusionVMR相比现有方法具有更高的效果。
Abstract
video moment retrieval
is a fundamental visual-language task that aims to retrieve
target moments
from an untrimmed video based on a language query. Existing methods typically generate numerous proposals manually
→