Scan2Cap：RGB-D扫描中基于上下文的密集字幕生成

Dec, 2020

Scan2Cap：RGB-D扫描中基于上下文的密集字幕生成

Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Dave Zhenyu Chen, Ali Gholami, Matthias Nießner, Angel X. Chang

TL;DR本文介绍了使用Scan2Cap方法对3D扫描中的物体进行检测和描述，在生成的描述中使用注意力机制和消息传递图模块，取得了显著的性能提升。

Abstract

We introduce the task of dense captioning in 3d scans from commodity RGB-D sensors. As input, we assume a point cloud of a 3D scene; the expected output is the bounding boxes along with the descriptions for the u