In this paper, we study the problem of outlier arm detection in multi-armed
bandit settings, which finds plenty of applications in many high-impact domains
such as finance, healthcare, and online advertising. For this problem, a
learner aims to identify the arms whose expected rewards