CLIP 对纹理的理解能力如何？

Mar, 2022

Leveraging Textures in Zero-shot Understanding of Fine-Grained Domains

Chenyun Wu, Subhransu Maji

TL;DR本研究探讨了CLIP在自然语言描述的自然图像中对纹理的理解能力。我们分析了CLIP在各种纹理和材质分类数据集上的零样本学习表现，分析了它对DTDD数据集上红点或黄色条纹等纹理组成特性的表达能力，以及对通过描述鸟身体部位的颜色和纹理来进行细粒度分类的帮助。

Abstract

Textures can be used to describe the appearance of objects in a wide range of fine-grained domains. Textures are localized and one can often refer to their properties in a manner that is independent of the object identity. Moreover, there is a rich vocabulary to describe textures corresponding to properties such as their color, pattern, structure, periodicit