BriefGPT.xyz
May, 2022
基于安全性的分段独立同分布赌博机变点检测
Safety Aware Changepoint Detection for Piecewise i.i.d. Bandits
HTML
PDF
Subhojyoti Mukherjee
TL;DR
本文考虑在安全约束下,针对分段独立同分布赌博机的问题,引入了适应性算法,探测并重新开始实验,同时提供了相应的遗憾上界和匹配下界。实验表明,相较于不符合安全约束的算法,本文提出的带安全约束的算法性能相似。
Abstract
In this paper, we consider the setting of
piecewise i.i.d. bandits
under a
safety constraint
. In this piecewise i.i.d. setting, there exists a finite number of
→