BriefGPT.xyz
Jun, 2019
上下文臂选择模型
Model selection for contextual bandits
HTML
PDF
Dylan J. Foster, Akshay Krishnamurthy, Haipeng Luo
TL;DR
介绍了在上下文密集应用中的模型选择问题及其解决方案,该方案适用于线性上下文密集应用,并在先验知识下达到了较低的后验概率。
Abstract
We introduce the problem of
model selection
for
contextual bandits
, wherein a learner must adapt to the complexity of the optimal policy while balancing
→