BriefGPT.xyz
Jan, 2023
概率上任何时间安全的随机组合半臂匪
Probably Anytime-Safe Stochastic Combinatorial Semi-Bandits
HTML
PDF
Yunlong Hou, Vincent Y. F. Tan, Zixin Zhong
TL;DR
提出了 probably anytime-safe stochastic combinatorial semi-bandits 问题及其改善风险的算法 PASCombUCB,可应用于推荐系统和交通运输领域等代理人在单个时间步内选择多个项目并希望在整个时间范围内控制风险的情境。
Abstract
Motivated by concerns about making online decisions that incur undue amount of
risk
at each time step, in this paper, we formulate the probably anytime-safe stochastic
combinatorial
→