BriefGPT.xyz
Dec, 2022
无需训练的结构扩散引导的组合文本到图像合成
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
HTML
PDF
Weixi Feng, Xuehai He, Tsu-Jui Fu, Varun Jampani, Arjun Akula...
TL;DR
本文基于扩散模型的可控属性,将语言结构与扩散过程相结合,进一步提高了T2I模型的组合能力,特别是更准确的属性绑定和更好的图像组合,这得益于跨注意层的帮助和语言洞察力。
Abstract
Large-scale
diffusion models
have achieved state-of-the-art results on
text-to-image synthesis
(T2I) tasks. Despite their ability to generate high-quality yet creative images, we observe that attribution-binding
→