BriefGPT.xyz
Jan, 2018
视频和音频检索的跨模态嵌入
Cross-modal Embeddings for Video and Audio Retrieval
HTML
PDF
Didac Surís, Amanda Duarte, Amaia Salvador, Jordi Torres, Xavier Giró-i-Nieto
TL;DR
本文介绍了一种利用 YouTube-8M 数据库中视听文件间共同区域来建立联系以自主训练深度神经网络的方法,实现了跨模态特征学习的无监督方法,并得出了良好的检索结果。
Abstract
The increasing amount of online videos brings several opportunities for training
self-supervised neural networks
. The creation of large scale datasets of videos such as the
youtube-8m
allows us to deal with this
→