BriefGPT.xyz
Apr, 2024
MULAN:用于可控文本到图像生成的多层注释数据集
MULAN: A Multi Layer Annotated Dataset for Controllable Text-to-Image Generation
HTML
PDF
Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang, Fei Chen, Steven McDonagh...
TL;DR
将一幅单眼RGB图像分解成为一个包含背景和独立实例的RGBA层叠,并重建遮挡区域,为高质量图像提供实例分解和遮挡信息的第一个照片逼真资源,为文本到图像生成AI研究开辟新的可能性。
Abstract
text-to-image generation
has achieved astonishing results, yet precise
spatial controllability
and prompt fidelity remain highly challenging. This limitation is typically addressed through cumbersome prompt engin
→