BriefGPT.xyz
May, 2024
TIE:针对复杂提示和高保真度编辑的文本图像编辑革新
TIE: Revolutionizing Text-based Image Editing for Complex-Prompt Following and High-Fidelity Editing
HTML
PDF
Xinyu Zhang, Mengxue Kang, Fei Wei, Shuang Xu, Yuhe Liu...
TL;DR
我们提出了一种创新的图像编辑框架,利用多模式大语言模型(LLMs)的强大的思路链条推理和本地化能力来辅助扩散模型生成更加精细的图像。
Abstract
As the field of
image generation
rapidly advances, traditional
diffusion models
and those integrated with
multimodal large language models
→