BriefGPT.xyz
Aug, 2022
IVT: 一种端到端实例引导的视频Transformer用于3D姿态估计
IVT: An End-to-End Instance-guided Video Transformer for 3D Pose Estimation
HTML
PDF
Zhongwei Qiu, Qiansheng Yang, Jian Wang, Dongmei Fu
TL;DR
本文提出了一种基于实例引导视频变换器(IVT)的范式,该范式可以从视觉特征中有效地学习时空上下文深度信息,并直接从视频帧中预测3D姿势,实验结果显示该方法在三个广泛使用的3D姿势评估基准上取得了最先进的表现。
Abstract
Video
3d human pose estimation
aims to localize the 3D coordinates of human joints from videos. Recent
transformer-based approaches
focus on capturing the spatiotemporal information from sequential 2D poses, whic
→