Mark Collier, Efi Kokiopoulou, Andrea Gesmundo, Jesse Berent
TL;DR使用稀疏路由网络和共同训练来提高不断学习的性能,其最终可以超过基于全连接网络的性能。
Abstract
The core challenge with continual learning is catastrophic forgetting, the phenomenon that when neural networks are trained on a sequence of tasks they rapidly forget previously learned tasks. It has been observe