BriefGPT.xyz
May, 2023
减缓长短期记忆网络的灾难性遗忘
Mitigating Catastrophic Forgetting in Long Short-Term Memory Networks
HTML
PDF
Ketaki Joshi, Raghavendra Pradyumna Pothukuchi, Andre Wibisono, Abhishek Bhattacharjee
TL;DR
本文研究在序列数据上的持续学习问题,重点讨论了LSTM网络的遗忘和多任务学习问题,并提出了两种有效的解决方案,证明了这种方法比现有的权重正则化方法更为简单、高效,可应用于计算机系统优化和自然语言处理等领域。
Abstract
continual learning
on
sequential data
is critical for many
machine learning
(ML) deployments. Unfortunately,
→