AbstractAs opposed to standard empirical risk minimization (ERM),
distributionally robust optimization aims to minimize the worst-case risk over a larger ambiguity set containing the original empirical distribution of the training data. In this work, we describe a
→