BriefGPT.xyz
Nov, 2020
SEA: 用文本查询进行视频检索的句子编码器组合
SEA: Sentence Encoder Assembly for Video Retrieval by Textual Queries
HTML
PDF
Xirong Li, Fangming Zhou, Chaoxi Xu, Jiaqi Ji, Gang Yang
TL;DR
本研究提出了一种名为 Sentence Encoder Assembly 的新方法,通过多空间多损失学习实现语句编码器的有效利用和文本-视频匹配,并在四个基准测试中表现出优于当前最先进技术的性能。
Abstract
Retrieving unlabeled videos by textual queries, known as
ad-hoc video search
(AVS), is a core theme in multimedia data management and retrieval. The success of AVS counts on
cross-modal representation learning
th
→