In many quantum tasks, there is an unknown quantum object that one wishes to
learn. An online strategy for this task involves adaptively refining a
hypothesis to reproduce such an object or its measurement statistics. A common
evaluation metric for such a strategy is its regret, or rou