BriefGPT.xyz
Sep, 2019
通过上下文边界感知预测,在视频中确定语言查询的时间基点
Temporally Grounding Language Queries in Videos by Contextual Boundary-aware Prediction
HTML
PDF
Jingwen Wang, Lin Ma, Wenhao Jiang
TL;DR
本文提出了一种基于Contextual Boundary-aware Prediction (CBP)的端到端模型来在视频中定位语句,并通过明确建模当前元素与其邻居之间的关系来聚合上下文信息,最终在三个公共数据集上表现显著优于现有的方法。
Abstract
The task of
temporally grounding
language queries in
videos
is to temporally localize the best matched video segment corresponding to a given language (sentence). It requires certain models to simultaneously perf
→