TL;DR该研究提出 Maximum Concept Matching(MCM)这一零样本的多模态算法来识别异常数据,利用视觉特征与文本概念进行对齐。研究发现 MCM 比单模态算法在效果上更为优秀,特别是结合视觉-语言特征时。
Abstract
Recognizing out-of-distribution (OOD) samples is critical for machine learning systems deployed in the open world. The vast majority of OOD detection methods are driven by a single modality (e.g., either vision or language), leaving the rich information in multi-modal representations u