adaptive sequential decision making is one of the central challenges in
machine learning and artificial intelligence. In such problems, the goal is to
design an interactive policy that plans for an action to take, from a finite
set of $n$ actions, given some partial observations. It ha