BriefGPT.xyz
Sep, 2024
重新思考丰富上下文的布局到图像生成的训练和评估
Rethinking The Training And Evaluation of Rich-Context Layout-to-Image Generation
HTML
PDF
Jiaxin Cheng, Zixu Zhao, Tong He, Tianjun Xiao, Yicong Zhou...
TL;DR
本研究解决了布局到图像生成中,现有方法在复杂文本描述场景下表现不佳的问题。提出了一种新颖的区域交叉注意力模块,以增强生成过程,并提出了评估开放词汇情景下生成性能的新指标。研究发现,这些指标与人类偏好高度一致,具有重要的应用潜力。
Abstract
Recent advancements in
Generative Models
have significantly enhanced their capacity for image generation, enabling a wide range of applications such as image editing, completion and video editing. A specialized area within generative modeling is
→