BriefGPT.xyz
Jun, 2023
具有异质奖励方差的固定预算的最佳臂识别
Fixed-Budget Best-Arm Identification with Heterogeneous Reward Variances
HTML
PDF
Anusha Lalitha, Kousha Kalantari, Yifei Ma, Anoop Deoras, Branislav Kveton
TL;DR
研究在异质奖励方差的固定预算设置下的最佳臂识别问题,提出两种方差自适应的算法:SHVar和SHAdaVar,分别用于已知奖励方差和未知奖励方差情况下,通过不均匀预算分配实现对高方差臂的偏好,本文还给出了误判最佳臂的概率界限。
Abstract
We study the problem of
best-arm identification
(BAI) in the
fixed-budget setting
with
heterogeneous reward variances
. We propose two vari
→