BriefGPT.xyz
Dec, 2019
视频动作识别的门移网络
Gate-Shift Networks for Video Action Recognition
HTML
PDF
Swathikiran Sudhakaran, Sergio Escalera, Oswald Lanz
TL;DR
本文中提出使用空间门控机制来处理3D核的空间-时间分解,实现Gate-Shift Module (GSM) 用于视频动作识别,结果在 Something Something-V1 和 Diving48 数据集上达到了最新的最优结果,而且在 EPIC-Kitchens 数据集上,获得了竞争性结果,具有远低于模型复杂度的优点。
Abstract
deep 3d cnns
for
video action recognition
are designed to learn powerful representations in the joint spatio-temporal feature space. In practice however, because of the large number of parameters and computations
→