Even though deep neural models have achieved superhuman performance on many
popular benchmarks, they have failed to generalize to OOD or adversarial
datasets. Conventional approaches aimed at increasing robustness include
developing increasingly large models and augmentation with large