Inspired by the problem of improving classification accuracy on rare or hard
subsets of a population, there has been recent interest in models of learning
where the goal is to generalize to a collection of distributions, each
representing a ``group''. We consider a variant of this prob