BriefGPT.xyz
Dec, 2021
使用潜在扩散模型进行高分辨率图像合成
High-Resolution Image Synthesis with Latent Diffusion Models
HTML
PDF
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, Björn Ommer
TL;DR
通过在预训练的自编码器的潜在空间中应用扩散模型,引入交叉注意力层到模型体系结构中,以更少的计算要求取得接近最优的性能,实现高分辨率合成,缩小像素级DMs对计算资源的需求。
Abstract
By decomposing the image formation process into a sequential application of denoising
autoencoders
,
diffusion models
(DMs) achieve state-of-the-art synthesis results on image data and beyond. Additionally, their
→