BriefGPT.xyz
Jun, 2020
利用视觉语义推理进行视频-文本检索
Exploiting Visual Semantic Reasoning for Video-Text Retrieval
HTML
PDF
Zerun Feng, Zhimin Zeng, Caili Guo, Zheng Li
TL;DR
为了提高视频检索的性能,我们提出了一种名为ViSERN的可视化语义增强的推理网络,该网络利用图卷积网络执行随机游走规则来生成涉及语义关系的区域特征,并聚合这些特征以形成帧级特征, 以求衡量视频和文本之间的相似性。
Abstract
video retrieval
is a challenging research topic bridging the vision and language areas and has attracted broad attention in recent years. Previous works have been devoted to representing videos by directly encoding from
→