We present an approach that learns to synthesize high-quality, novel views of 3d objects or scenes, while providing fine-grained and precise control over the 6-DOF viewpoint. The approach is self-supervised and only requires 2D images and associated view transforms for training. Our ma