generating images from text has become easier because of the scaling of
diffusion models and advancements in the field of vision and language. These
models are trained using vast amounts of data from the Internet. Hence, they
often contain undesirable content such as copyrighted materi