BriefGPT.xyz
Mar, 2024
优化文本到图像生成:走向准确、无需训练的字形增强图像生成
Refining Text-to-Image Generation: Towards Accurate Training-Free Glyph-Enhanced Image Generation
HTML
PDF
Sanyam Lakhanpal, Shivang Chopra, Vinija Jain, Aman Chadha, Man Luo
TL;DR
改进了LenCom-Eval和MARIO-Eval基准测试的OCR效果的训练自由框架, 提供了生成包含长且少见文本序列图像的新方法。
Abstract
Over the past few years, Text-to-Image (T2I) generation approaches based on
diffusion models
have gained significant attention. However, vanilla
diffusion models
often suffer from spelling inaccuracies in the tex
→