BriefGPT.xyz
Apr, 2024
利用大型语言模型演变可解释的视觉分类器
Evolving Interpretable Visual Classifiers with Large Language Models
HTML
PDF
Mia Chiquier, Utkarsh Mall, Carl Vondrick
TL;DR
通过演化搜索算法和大语言模型的上下文学习能力,我们提出了一种能够发现解释性又具有辨识性的用于视觉识别的属性集合的新方法,并在五个细粒度的iNaturalist数据集上比最先进的基准方法提高了18.4%,在两个KikiBouba数据集上提高了22.2%。
Abstract
multimodal pre-trained models
, such as CLIP, are popular for
zero-shot classification
due to their open-vocabulary flexibility and high performance. However,
→