BriefGPT.xyz
Jul, 2024
利用大型语言模型的背景知识提高强化学习的样本效率
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
HTML
PDF
Fuxiang Zhang, Junyou Li, Yi-Chen Li, Zongzhang Zhang, Yang Yu...
TL;DR
用大型语言模型(DLLM)提取环境背景知识的框架,可在多个强化学习任务中提高样本效率。
Abstract
Low
sample efficiency
is an enduring challenge of
reinforcement learning
(RL). With the advent of versatile
large language models
(LLMs),
→