CVaR估计的浓度界: 轻尾分布与重尾分布的情形

Jan, 2019

CVaR估计的浓度界: 轻尾分布与重尾分布的情形

Risk-aware Multi-armed Bandits Using Conditional Value-at-Risk

Ravi Kumar Kolla, Prashanth L A, Krishna Jagannathan

TL;DR该研究使用经验分布和截断法估算CVaR，得出其轻尾和重尾分布的集中界，并将其应用于多臂老虎机问题中，提出了基于CVaR的连续拒绝算法，并利用CVaR集中结果导出了算法错误识别概率的上界。

Abstract

Traditional multi-armed bandit problems are geared towards finding the arm with the highest expected value -- an objective that is risk-neutral. In several practical applications, e.g., finance, a risk-sensitive objective is to control the worst-case losses and →