In many real-world applications, we want to exploit multiple source datasets
of similar tasks to learn a model for a different but related target dataset --
e.g., recognizing characters of a new font using a set of different fonts.
While most recent research has considered ad-hoc combi