BriefGPT.xyz
Feb, 2023
VoxFormer:基于摄像机的稀疏体素变换器用于三维语义场景完成
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion
HTML
PDF
Yiming Li, Zhiding Yu, Christopher Choy, Chaowei Xiao, Jose M. Alvarez...
TL;DR
本论文提出了一种基于Transformer的场景语义补全框架VoxFormer,可以从2D图像中输出完整的3D体素语义,并在测试中获得了相对20%的几何和18.1%的语义方面的提升。
Abstract
Humans can easily imagine the complete
3d geometry
of occluded objects and scenes. This appealing ability is vital for recognition and understanding. To enable such capability in AI systems, we propose VoxFormer, a Transformer-based
→