Mar, 2024

IllusionVQA:一个为视觉语言模型设计的具有挑战性的视错觉数据集

TL;DRVision Language Models are tested on the IllusionVQA dataset, revealing their performance and weaknesses in comprehension and soft localization tasks, particularly in the context of optical illusions and In-Context Learning.