BriefGPT.xyz
Dec, 2022
图像和视频的桥接:用于大词汇视频目标检测的简单学习框架
Bridging Images and Videos: A Simple Learning Framework for Large Vocabulary Video Object Detection
HTML
PDF
Sanghyun Woo, Kwanyong Park, Seoung Wug Oh, In So Kweon, Joon-Young Lee
TL;DR
该论文提出了一个新的学习框架,结合LVIS和TAO数据集,解决了监督不足的问题,从而实现在视频识别中的检测和追踪,进而在TAO基准之上,提升了大型目标追踪器的表现。
Abstract
Scaling
object taxonomies
is one of the important steps toward a robust real-world deployment of recognition systems. We have faced remarkable progress in images since the introduction of the
lvis
benchmark. To c
→