BriefGPT.xyz
Jun, 2024
LLM4GEN:利用语义表示的LLM用于文本到图像生成
LLM4GEN: Leveraging Semantic Representation of LLMs for Text-to-Image Generation
HTML
PDF
Mushui Liu, Yuhang Ma, Xinfeng Zhang, Yang Zhen, Zeng Zhao...
TL;DR
LLM4GEN通过结合LLMs特征设计的Cross-Adapter模块,有效提高了复杂和密集提示的语义理解能力,为text-to-image生成任务带来了显著改进,并在sample质量、图像文本对齐和人工评估方面超越了现有的最先进模型。
Abstract
diffusion models
have exhibited substantial success in
text-to-image generation
. However, they often encounter challenges when dealing with complex and
→