BriefGPT.xyz
Oct, 2024
MCGM:掩膜条件文本到图像生成模型
MCGM: Mask Conditional Text-to-Image Generative Model
HTML
PDF
Rami Skaik, Leonardo Rossi, Tomaso Fontanini, Andrea Prati
TL;DR
本研究解决了现有生成模型在生成特定姿势图像时的局限性。我们提出的掩膜条件文本到图像生成模型(MCGM)通过引入掩膜嵌入注入技术,提供对生成过程的灵活控制,使用户能够基于需求生成高质量图像。实验证明,MCGM有效提升了当前Break-a-scene生成模型的性能。
Abstract
Recent advancements in generative models have revolutionized the field of artificial intelligence, enabling the creation of highly-realistic and detailed images. In this study, we propose a novel Mask Conditional
Text-to-Image
→