TL;DR该研究使用基于 GAN latent space 的技术和生成式扩散模型,通过条件输入的两个 latent codes(空间内容掩码和扁平化样式嵌入)对其生成进行控制,从而实现图像的有效操控和转化。
Abstract
denoising diffusion models have shown remarkable capabilities in generating realistic, high-quality and diverse images. However, the extent of controllability and editability with diffusion models is underexplored relative to GANs. Inspired by techniques based on the latent space of GA