BriefGPT.xyz
Jun, 2023
带补给背包的强盗问题:两全其美
Bandits with Replenishable Knapsacks: the Best of both Worlds
HTML
PDF
Martino Bernasconi, Matteo Castiglioni, Andrea Celli, Federico Fusco
TL;DR
该研究提出了一种BwK框架的一般化模型,允许非单调资源利用,并提出了一个灵活的双重模板以处理任何具有再生性问题的在线学习问题,包括对抗和随机输入,同时可用于解决一些实际相关的经济问题。
Abstract
The bandits with knapsack (BwK) framework models
online decision-making
problems in which an agent makes a sequence of decisions subject to
resource consumption constraints
. The traditional model assumes that eac
→