We propose an approach to predict the 3D shape and pose for the objects
present in a scene. Existing learning based methods that pursue this goal make
independent predictions per object, and do not leverage the relationships
amongst them. We argue that reasoning about these relationships is crucial, and
present an approach to incorporate these in a 3D predic