BriefGPT.xyz
Dec, 2021
ScanQA: 三维问题回答用于空间场景理解
ScanQA: 3D Question Answering for Spatial Scene Understanding
HTML
PDF
Daichi Azuma, Taiki Miyanishi, Shuhei Kurita, Motoki Kawanabe
TL;DR
通过学习语言表达与三维场景的地理特征相关的学习描述符,我们提出了一种基线模型(ScanQA),用于在三维环境中执行基于对象的问题回答,并构建了一个新的ScanQA数据集,其中包含来自800个室内场景的40,000个问题答案对。
Abstract
We propose a new
3d spatial understanding
task of 3D Question Answering (
3d-qa
). In the
3d-qa
task, models receive visual information from
→