BriefGPT.xyz
Jan, 2024
上下文固定预算的最佳臂识别:具有策略学习的自适应实验设计
Contextual Fixed-Budget Best Arm Identification: Adaptive Experimental Design with Policy Learning
HTML
PDF
Masahiro Kato, Kyohei Okumura, Takuya Ishihara, Toru Kitagawa
TL;DR
个性化治疗建议、最佳治疗方法鉴定、上下文信息、自适应实验以及策略学习是这篇研究论文的关键词,通过推荐最佳治疗方法的决策策略获得最小的预期简单后悔,同时为政策学习、实验设计和自适应福利最大化提供了新的方法。
Abstract
individualized treatment recommendation
is a crucial task in evidence-based decision-making. In this study, we formulate this task as a fixed-budget
best arm identification
(BAI) problem with
→