AbstractIn this paper, we argue that iterative computation with
Diffusion Models offers a powerful paradigm for not only generation but also visual perception tasks. We unify tasks such as depth estimation, optical flow, and amodal segmentation under the framework of
→