BriefGPT.xyz
Sep, 2024
GRIN:基于像素级扩散的零-shot度量深度估计
GRIN: Zero-Shot Metric Depth with Pixel-Level Diffusion
HTML
PDF
Vitor Guizilini, Pavel Tokmakov, Achal Dave, Rares Ambrus
TL;DR
本研究解决了从单幅图像进行3D重建时存在的尺度模糊问题,提出了一种名为GRIN的高效扩散模型,能够处理稀疏无结构的训练数据。通过在扩散过程中结合图像特征与3D几何位置编码,该方法在跨八个室内外数据集实验中展示了新的零-shot标准单目深度估计的最佳性能,具有重要的潜在应用价值。
Abstract
3D Reconstruction
from a single image is a long-standing problem in
Computer Vision
. Learning-based methods address its inherent scale ambiguity by leveraging increasingly large labeled and unlabeled datasets, to
→