BriefGPT.xyz
Mar, 2021
鲁棒音频视觉实例判别
Robust Audio-Visual Instance Discrimination
HTML
PDF
Pedro Morgado, Ishan Misra, Nuno Vasconcelos
TL;DR
本文介绍了一种自监督学习方法,以学习音频和视频表征,并通过行动识别任务的实验验证了其解决音频-视觉实例区别问题和提高迁移学习性能的贡献。
Abstract
We present a
self-supervised learning
method to learn audio and video representations. Prior work uses the natural correspondence between audio and video to define a standard cross-modal
instance discrimination
t
→