Ning Xu, Linjie Yang, Yuchen Fan, Jianchao Yang, Dingcheng Yue...
TL;DR本文介绍了一个基于大规模数据集的序列-序列网络,能够充分利用视频的长期时空信息进行分割,在 YouTube-VOS 测试集上取得了最佳结果,在 DAVIS 2016 上与现有最先进方法相比也有可比性。
Abstract
Learning long-term spatial-temporal features are critical for many video analysis tasks. However, existing video segmentation methods predominantly rely on static image segmentation techniques, and methods captur