Gianmarco Genalti, Lupo Marsigli, Nicola Gatti, Alberto Maria Metelli
TL;DR学习重尾分布下的遗憾最小化问题,引入自适应算法并提供适应性鲁棒 UCB 方法,以最小化重尾 MAB 问题的遗憾。
Abstract
heavy-tailed distributions naturally arise in many settings, from finance to telecommunications. While regret minimization under sub-Gaussian or bounded support rewards has been widely studied, learning on