BriefGPT.xyz
May, 2024
OpenDAS: 开放词汇切分的领域适应
OpenDAS: Domain Adaptation for Open-Vocabulary Segmentation
HTML
PDF
Gonca Yilmaz, Songyou Peng, Francis Engelmann, Marc Pollefeys, Hermann Blum
TL;DR
我们提出了一种基于视觉语言模型的领域自适应方法,通过结合参数高效的提示微调和三元组损失训练策略,提高了开放词汇的普适性,并适应了视觉领域,改善了开放词汇分割任务中的性能。
Abstract
The advent of
vision language models
(VLMs) transformed image understanding from closed-set classifications to dynamic image-language interactions, enabling
open-vocabulary segmentation
. Despite this flexibility,
→