BriefGPT.xyz
Oct, 2024
在多样化背景下学习定制文本到图像的扩散模型
Learning to Customize Text-to-Image Diffusion In Diverse Context
HTML
PDF
Taewook Kim, Wei Chen, Qiang Qiu
TL;DR
本研究解决了现有文本到图像定制技术因训练数据过于局限而导致的泛化能力不足的问题。作者提出通过构建丰富的文本提示集来多样化个人概念的上下文,从而显著提高了语义对齐,并提升了生成图像的保真度。该方法不需要架构修改,兼容现有的定制技术,扩展性强。
Abstract
Most
Text-to-Image
Customization
techniques fine-tune models on a small set of \emph{personal concept} images captured in minimal contexts. This often results in the model becoming overfitted to these training im
→