BriefGPT.xyz
Jun, 2021
通过离线神谕在具有多个类别的情境下选择最优模型的方法
Optimal Model Selection in Contextual Bandits with Many Classes via Offline Oracles
HTML
PDF
Sanath Kumar Krishnamurthy, Susan Athey
TL;DR
本研究提出了一种新的算法,用于解决上下文Bandit问题中的模型选择问题,该算法通过离线模型选择预言机的方式平衡偏差-方差交换和探索-利用交换,并具有与回归模型选择相同的计算要求。
Abstract
We study the problem of
model selection
for
contextual bandits
, in which the algorithm must balance the bias-variance trade-off for model estimation while also balancing the
→