TL;DR提出了第一个开放式视频实例分割(Open-World Video Instance Segmentation, OW-VIS)方法——OW-VISFormer,它引入了一个新的特征增强机制和一个时空客体性(Spatio-Temporal Objectness, STO)模块,并评估了其在开放式实验室下的特性。
Abstract
Existing video instance segmentation (VIS) approaches generally follow a closed-world assumption, where only seen category instances are identified and spatio-temporally segmented at inference. open-world formulation