BriefGPT.xyz
Dec, 2024
使用软替代品预测长期顺序政策价值
Predicting Long Term Sequential Policy Value Using Softer Surrogates
HTML
PDF
Hyunji Nam, Allen Nie, Ge Gao, Vasilis Syrgkanis, Emma Brunskill
TL;DR
本研究解决了在教育、医疗和在线商业中政策评估面临的挑战,特别是需要长时间观察结果的问题。通过引入两种新的评估估计器,研究展示了如何仅利用短期数据和历史全周期数据,快速有效地估计新决策政策的全周期价值,从而在十倍速上实现比传统方法更快的政策评估。
Abstract
Performing
policy evaluation
in education,
healthcare
and online commerce can be challenging, because it can require waiting substantial amounts of time to observe outcomes over the desired horizon of interest. W
→