BriefGPT.xyz
Nov, 2024
利用可靠随机种子增强组合文本到图像生成
Enhancing Compositional Text-to-Image Generation with Reliable Random Seeds
HTML
PDF
Shuangqi Li, Hieu Le, Jingyi Xu, Mathieu Salzmann
TL;DR
本研究解决了文本到图像生成模型在处理组合提示(如“两个狗”或“碗右侧的企鹅”)时产生不一致结果的问题。我们提出了一种挖掘可靠噪声模式的方法,创建了无须人工标注的训练集,通过微调模型显著提高了其组合能力,特别是在数值组合与空间组合方面取得了显著提升。
Abstract
Text-to-image
Diffusion Models
have demonstrated remarkable capability in generating realistic images from arbitrary text prompts. However, they often produce inconsistent results for
Compositional Prompts
such a
→