BriefGPT.xyz
Dec, 2024
面向开放词汇的视频语义分割
Towards Open-Vocabulary Video Semantic Segmentation
HTML
PDF
Xinhao Li, Yun Liu, Guolei Sun, Min Wu, Le Zhang...
TL;DR
本研究着眼于解决现有视频语义分割模型在面临不熟悉类别时的挑战,提出了开放词汇视频语义分割(OV-VSS)任务。我们提出了OV2VSS模型,利用时空融合模块和随机帧增强模块,提高了模型在各种开放词汇类别上的分割性能。实验结果表明,OV2VSS在处理新类别时具有零样本泛化能力,显著提升了视频语义分割任务的效果。
Abstract
Semantic Segmentation
in videos has been a focal point of recent research. However, existing models encounter challenges when faced with unfamiliar categories. To address this, we introduce the
Open Vocabulary
Vi
→