BriefGPT.xyz
Jan, 2021
时空动作定位的活动图变换器
Activity Graph Transformer for Temporal Action Localization
HTML
PDF
Megha Nawhal, Greg Mori
TL;DR
该研究提出了一种基于深度学习的Activity Graph Transformer模型,可以对视频进行端到端分析,精确定位和识别视频内的特定事件活动,并通过非线性图推理方法捕获视频内事件之间的复杂时间结构。实验结果显示此方法在三个具有挑战性的数据集上均优于当前领先的方法。
Abstract
We introduce
activity graph transformer
, an end-to-end learnable model for
temporal action localization
, that receives a video as input and directly predicts a set of action instances that appear in the video. De
→