BriefGPT.xyz
Aug, 2022
一种特征空间多模态数据增强技术用于文本-视频检索
A Feature-space Multimodal Data Augmentation Technique for Text-video Retrieval
HTML
PDF
Alex Falcon, Giuseppe Serra, Oswald Lanz
TL;DR
本文介绍了利用文本-视频检索方法,并结合数据增强技术及多模态数据的方法,对大规模公共数据集EPIC-Kitchens-100的测试性能进行提升,灵敏的处理方式能以自然语言查询进行相关视频的查找。
Abstract
Every hour, huge amounts of visual contents are posted on social media and user-generated content platforms. To find relevant videos by means of a
natural language query
,
text-video retrieval
methods have receive
→