TL;DR提出一种高效的多视角立体(MVS)网络以实现多视角图像的三维重建和深度推断,其采用策略推断出粗到细的深度图,其中引入自注意力层与相似度测量来生成新的代价体以进行深度图细化,最终实验表明该模型优于大多数 SOTA 方法。
Abstract
We present an efficient multi-view stereo (MVS) network for 3d reconstruction from multiview images. While previous learning based reconstruction approaches performed quite well, most of them estimate depth maps at a fixed resolution using plane sweep volumes with a fixed depth hypothe