BriefGPT.xyz
Jun, 2023
跨模态知识蒸馏增强的多模态表征学习
Enhanced Multimodal Representation Learning with Cross-modal KD
HTML
PDF
Mengxi Chen, Linyu Xing, Yu Wang, Ya Zhang
TL;DR
本文介绍了一种基于多模态表示学习的学生-教师框架,并采用了对比学习和对抗学习等方法来优化相互信息和条件熵,以提高视频识别、检索和情感分类等多模态任务的性能。
Abstract
This paper explores the tasks of leveraging auxiliary modalities which are only available at training to enhance
multimodal representation learning
through
cross-modal knowledge distillation
(KD). The widely adop
→