BriefGPT.xyz
Oct, 2019
一种基于叙事的奖励塑造方法,使用基于语境的自然语言指令
A Narration-based Reward Shaping Approach using Grounded Natural Language Commands
HTML
PDF
Nicholas Waytowich, Sean L. Barton, Vernon Lawhern, Garrett Warnell
TL;DR
通过自然语言引导,我们对深度强化学习技术进行了改进,实现了对StarCraft II等任务的有效训练,并与传统的奖励塑形方法相比,取得了更好的性能表现。
Abstract
While
deep reinforcement learning
techniques have led to agents that are successfully able to learn to perform a number of tasks that had been previously unlearnable, these techniques are still susceptible to the longstanding problem of
→