Yonatan Mintz, Anil Aswani, Philip Kaminsky, Elena Flowers, Yoshimi Fukuoka
TL;DR提出了ROGUE(Reducing or Gaining Unknown Efficacy)类模型及其算法ROGUE-UCB,可捕捉到具有非稳态现象的问题模型,经实验证明优于现有算法并应用于个性化医疗干预以增加身体活动。
Abstract
Many settings require a decision maker to repeatedly choose from a set of interventions to apply to an individual without knowing the interventions' efficacy a priori. However, repeated application of a specific intervention may reduce its efficacy, while abstaining from applying an intervention may cause its efficacy to recover. Such phenomena are observed