基于生成对抗网络的文本到图像生成

Oct, 2024

基于生成对抗网络的文本到图像生成

Text-To-Image with Generative Adversarial Networks

Mehrshad Momen-Tayefeh

TL;DR本研究解决了从文本生成逼真图像这一计算机视觉领域的难题。通过对比五种基于生成对抗网络（GAN）的方法，本文揭示了不同模型架构在生成图像时的分辨率差异，并确定了最佳和最差的分辨率。这项研究的关键发现是识别了性能最优的模型，为文本到图像生成提供了指导。

Abstract

Generating realistic images from human texts is one of the most challenging problems in the field of Computer Vision (CV). The meaning of descriptions given can be roughly reflected by existing Text-to-Image appr