BriefGPT.xyz
Apr, 2024
数值化奖励机器
Numeric Reward Machines
HTML
PDF
Kristina Levina, Nikolaos Pappas, Athanasios Karapantelakis, Aneta Vulgarakis Feljan, Jendrik Seipp
TL;DR
通过扩展奖励机制的数值特征,可以显著改善在数字化任务中的奖励机制的效果,并在与基准方法的比较中取得了显著优势。
Abstract
reward machines
inform reinforcement
learning
agents about the reward structure of the environment and often drastically speed up the
learning
→