Describing a scene in terms of primitives -- geometrically simple shapes that
offer a parsimonious but accurate abstraction of structure -- is an established
vision problem. This is a good model of a difficult fitting problem: different
scenes require different numbers of primitives an
Blocks2World 是一种新颖的 3D 场景渲染和编辑方法,通过凸分解和条件合成的两步过程,从各种物体中提取 3D 平行四边形来获取场景的原始表示,进而通过简单的射线追踪深度图来生成配对数据,最后训练条件模型,学习从 2D 渲染的凸多边形到图像的直接映射,从而实现对新颖场景和编辑场景的出色控制和综合。