TL;DR本文提出了一种多模态空间矫正器解决自我中心图像中景深和表面法线预测的挑战,同时提出了一个新的数据集 EDINA 并使用多模态空间矫正器进行单视角深度和表面法线预测,在常见的自我中心图像数据集上优于基准模型。
Abstract
In this paper, we study a problem of egocentric scene understanding, i.e.,
predicting depths and surface normals from an egocentric image. Egocentric
scene understanding poses unprecedented challenges: (1) due to large head
movements, the images are taken from non-canonical viewpoints