A few-shot generative model should be able to generate data from a novel
distribution by only observing a limited set of examples. In few-shot learning
the model is trained on data from many sets from distributions sharing some
underlying properties such as sets of characters from diff