BriefGPT.xyz
Sep, 2023
通过自然语言指导的语义探索提高深度强化学习的效率
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language
HTML
PDF
Zhourui Guo, Meng Yao, Yang Yu, Qiyue Yin
TL;DR
用检索式方法通过神经网络编码,选择性、高效地与oracle进行交互,并使用oracle的答案更新agent的策略和值函数,从而在强化学习任务中大幅提高效率。
Abstract
reinforcement learning
is a powerful technique for learning from trial and error, but it often requires a large number of
interactions
to achieve good performance. In some domains, such as sparse-reward tasks, an
→