BriefGPT.xyz
Jul, 2023
线性赌博机中的即时模型选择
Anytime Model Selection in Linear Bandits
HTML
PDF
Parnian Kassraie, Aldo Pacchiano, Nicolas Emmenegger, Andreas Krause
TL;DR
在线学习在模型选择时可以通过对线性赌博机进行全信息反馈来改进性能,从而在M个模型中具有对数级的依赖性,而不需要先验知识或纯探索阶段。
Abstract
model selection
in the context of
bandit optimization
is a challenging problem, as it requires balancing exploration and exploitation not only for action selection, but also for
→