BriefGPT.xyz
Aug, 2023
PAT: 基于位置感知的稠密多标签动作检测的Transformer
PAT: Position-Aware Transformer for Dense Multi-Label Action Detection
HTML
PDF
Faegheh Sardari, Armin Mustafa, Philip J. B. Jackson, Adrian Hilton
TL;DR
我们提出了PAT,一种基于Transformer的网络,通过利用多尺度时间特征来学习视频中复杂的时间共现动作依赖关系。
Abstract
We present
pat
, a
transformer-based network
that learns complex
temporal co-occurrence action dependencies
in a video by exploiting
→