Recent work has shown that deep learning models are prone to exploit spurious
correlations that are present in the training set, yet may not hold true in
general. A sentiment classifier may erroneously learn that the token spielberg
is always tied to positive movie reviews. Relying on