TL;DR在 GAN 生成模型中学习文本和图像表示之间的语义对齐以缓解文本图像语义不匹配问题,进而生成连贯、高质量的多句故事可视化。
Abstract
story visualization aims to generate a sequence of images to narrate each
sentence in a multi-sentence story, where the images should be realistic and
keep global consistency across dynamic scenes and characters.