BriefGPT.xyz
Jun, 2023
基于几何意义的线性赌博机算法平衡性能和理论保证
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
HTML
PDF
Yuwei Luo, Mohsen Bayati
TL;DR
该论文提出了一种基于数据驱动技术的算法,利用不确定椭球的几何性质追踪算法的习得性能,在不同的问题实例上实现实例相关的频率遗憾边界,从而实现算法实例纠错,并在保留基础算法大部分优良性质的同时,达到最小化讽刺性遗憾代价。
Abstract
This paper is motivated by recent developments in the
linear bandit
literature, which have revealed a discrepancy between the promising empirical performance of algorithms such as
thompson sampling
and Greedy, wh
→