TL;DR提出了一种名为Track Anything Model (TAM)的模型,它可以在视频中进行高效的交互式跟踪和分割,无需额外的训练,并在视频对象跟踪和分割方面表现出色。
Abstract
Recently, the segment anything model (sam) gains lots of attention rapidly due to its impressive segmentation performance on images. Regarding its strong ability on image segmentation and high interactivity with