BriefGPT.xyz
May, 2019
深度神经嵌入技术在视频无监督学习中的应用
Unsupervised Learning from Video with Deep Neural Embeddings
HTML
PDF
Chengxu Zhuang, Alex Andonian, Daniel Yamins
TL;DR
本文介绍了Video Instance Embedding(VIE)框架,它扩展了用于学习深度非线性嵌入的强大无监督损失函数以进行大规模视频数据集上的多流时间处理架构,展示了VIE训练的网络在Kinetics数据集的动作识别和ImageNet数据集的目标识别中有重大发展,并提供了分析表明路径如何有所不同。
Abstract
Because of the rich dynamical structure of videos and their ubiquity in everyday life, it is a natural idea that
video data
could serve as a powerful
unsupervised learning
signal for training visual representatio
→