BriefGPT.xyz
Dec, 2022
双正对象识别:视觉原理的WHY提示
Doubly Right Object Recognition: A Why Prompt for Visual Rationales
HTML
PDF
Chengzhi Mao, Revant Teotia, Amrutha Sundar, Sachit Menon, Junfeng Yang...
TL;DR
本文提出了一种叫做“双重正确”的目标识别基准,要求视觉模型不仅可以提供正确的标签,同时也要提供正确的理由,结果显示,使用语言模型为视觉表征提供正确理由可以显著提高视觉模型性能。
Abstract
Many
visual recognition models
are evaluated only on their classification accuracy, a metric for which they obtain strong performance. In this paper, we investigate whether
computer vision
models can also provide
→