BriefGPT.xyz
May, 2022
具备知识整合的记忆高效强化学习
Memory-efficient Reinforcement Learning with Knowledge Consolidation
HTML
PDF
Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood
TL;DR
我们提出了一种基于深度 Q 网络算法的记忆效率强化学习算法,通过从目标 Q 网络到当前 Q 网络合并知识,减少遗忘并保持高的样本效率。与基线方法相比,在特征和图像任务中取得了相当或更好的性能,同时减轻了大经验重放缓冲区的负担。
Abstract
artificial neural networks
are promising as general function approximators but challenging to train on non-independent and identically distributed data due to
catastrophic forgetting
. Experience replay, a standar
→