The correspondence between input text and the generated image exhibits
opacity, wherein minor textual modifications can induce substantial deviations
in the generated image. While, text embedding, as the pivotal intermediary
between text and images, remains relatively underexplored. In