BriefGPT.xyz
Feb, 2023
具有切换成本的近优敌对强化学习
Near-Optimal Adversarial Reinforcement Learning with Switching Costs
HTML
PDF
Ming Shi, Yingbin Liang, Ness Shroff
TL;DR
本文尝试解决如何开发一种可证明高效的带有转换代价的对抗性RL算法的问题,并提出了两种新颖的降低转换代价的算法,其回归分析证明了它们的近乎最优性能。
Abstract
switching costs
, which capture the costs for changing policies, are regarded as a critical metric in
reinforcement learning
(RL), in addition to the standard metric of losses (or rewards). However, existing studi
→