使用视觉Transformer进行高效的3D物体重建

Feb, 2023

使用视觉Transformer进行高效的3D物体重建

Efficient 3D Object Reconstruction using Visual Transformers

Rohan Agarwal, Wei Zhou, Xiaofeng Wu, Yuhan Li

TL;DR使用视觉transformer替代卷积在现有的高效，高性能的3D目标重建技术中，预测三维结构并取得类似或优于基线方法的准确度，表明视觉transformer在三维目标重建任务中有着巨大的潜力。

Abstract

Reconstructing a 3D object from a 2D image is a well-researched vision problem, with many kinds of deep learning techniques having been tried. Most commonly, 3D convolutional approaches are used, though previous work has shown state-of-the-art methods using →