TL;DR提出了一种名为 Character Image Feature Encoder 的模型,它能够通过简单地提供角色图片来生成符合预期的人物角色图像,而不需要对每个个体 / 动画角色图像进行训练,可以方便地将其集成到现有的生成模型中。
Abstract
The current state-of-the-art diffusion model has demonstrated excellent
results in generating images. However, the images are monotonous and are mostly
the result of the distribution of images of people in the training set, making
it challenging to generate multiple images for a fixed