BriefGPT.xyz
Oct, 2023
MCAD: 多教师跨模态对齐蒸馏以实现高效的图像-文本检索
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval
HTML
PDF
Youbo Lei, Feifei He, Chen Chen, Yingbin Mo, Si Jia Li...
TL;DR
使用多教师跨模态对齐蒸馏技术 (MCAD),通过在双流模型中融合单流特征提高学生模型的检索性能,同时实现高效的图像-文本检索任务,降低模型大小和终端设备部署的复杂性。
Abstract
With the success of
large-scale visual-language pretraining models
and the wide application of
image-text retrieval
in industry areas, reducing the model size and streamlining their terminal-device deployment hav
→