BriefGPT.xyz
Nov, 2023
跨1000帧的10亿参数端到端时序动作检测
End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
HTML
PDF
Shuming Liu, Chen-Lin Zhang, Chen Zhao, Bernard Ghanem
TL;DR
通过降低训练内存消耗,本研究提出了一种新颖的轻量级模块——时间信息适配器(TIA),有效地增加了时间动作检测(TAD)系统的规模和输入视频的帧数,从而显著提高了检测性能。
Abstract
Recently,
temporal action detection
(TAD) has seen significant performance improvement with
end-to-end training
. However, due to the memory bottleneck, only models with limited scales and limited data volumes can
→