BriefGPT.xyz
Aug, 2023
开探多模态上下文知识的开放词汇物体检测
Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection
HTML
PDF
Yifan Xu, Mengdan Zhang, Xiaoshan Yang, Changsheng Xu
TL;DR
该研究论文探索了多模态背景知识在开放词汇目标检测中的作用,并提出了一种多模态背景知识蒸馏框架,通过从多模态融合转换器中学习上下文知识并应用于学生检测器,取得了显著的提升。
Abstract
In this paper, we for the first time explore helpful
multi-modal contextual knowledge
to understand novel categories for
open-vocabulary
object detectio
→