visual storytelling includes two important parts: coherence between the story
and images as well as the story structure. For image to text neural network
models, similar images in the sequence would provide close information for
story generator to obtain almost identical sentence. Howe