BriefGPT.xyz
Jun, 2021
核和神经赌博中的纯探索
Pure Exploration in Kernel and Neural Bandits
HTML
PDF
Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett...
TL;DR
本文研究了一种新的纯探索选择策略,通过自适应地将每个手臂的特征表示嵌入到低维空间中并仔细处理引起的模型错误,成果展示了该方法在核空间或神经表示中实现的有效维度。实验证明了该方法的有效性。
Abstract
We study
pure exploration
in
bandits
, where the dimension of the feature representation can be much larger than the number of arms. To overcome the curse of dimensionality, we propose to adaptively embed the feat
→