BriefGPT.xyz
May, 2024
PT43D:基于单一高度模糊 RGB 图像生成三维形状的概率变换器
PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images
HTML
PDF
Yiheng Xiong, Angela Dai
TL;DR
提出了一种基于Transformer的自回归模型,根据可能以高度模糊的观测图像为基础的RGB图像,生成3D形状的概率分布,该模型采用交叉注意力机制,有效地识别形状生成的最相关兴趣区域,并在合成数据和真实数据上得到优于现有方法的结果。
Abstract
Generating
3d shapes
from single
rgb images
is essential in various applications such as robotics. Current approaches typically target images containing clear and complete visual descriptions of the object, witho
→