Junbo Zhang, Guofan Fan, Guanghan Wang, Zhengyuan Su, Kaisheng Ma...
TL;DR通过文本场景描述信息辅助 3D 特征学习,进而提升三维语义场景理解的效果,并构建更好的语言与三维结构的多模态任务。
Abstract
Learning descriptive 3D features is crucial for understanding 3D scenes with diverse objects and complex structures. However, it is usually unknown whether important geometric attributes and scene context obtain enough emphasis in an end-to-end trained 3D scene understanding network. T