BriefGPT.xyz
Dec, 2020
具备对抗成本和已知转移的随机最短路径最小化遗憾
Minimax Regret for Stochastic Shortest Path with Adversarial Costs and Known Transition
HTML
PDF
Liyu Chen, Haipeng Luo, Chen-Yu Wei
TL;DR
研究用Online Mirror Descent 框架的各种新技术,包括改进的多尺度专家算法、从一般随机最短路径到特殊无环情况的降低、倾斜的占用度量空间以及添加到成本估计器的新校正项等,以解决带对手成本的随机最短路径问题并同时减小学习者方差和最优策略的偏差。
Abstract
We study the
stochastic shortest path
problem with
adversarial costs
and known transition, and show that the
minimax regret
is $\widetilde
→