BriefGPT.xyz
Jul, 2023
通过主动遗忘预训练以提高语言可塑性
Improving Language Plasticity via Pretraining with Active Forgetting
HTML
PDF
Yihong Chen, Kelly Marchisio, Roberta Raileanu, David Ifeoluwa Adelani, Pontus Stenetor...
TL;DR
本文提出使用主动遗忘机制作为预训练过程中的一种简单方法,以创建能够快速适应新语言的PLMs。实验证明,与标准模型相比,在资源匮乏的情况下,使用遗忘机制的预先训练模型不仅在语言适应过程中表现出更快的收敛速度,而且在特别是对于与英语不同的语言来说表现更佳。
Abstract
pretrained language models
(PLMs) are today the primary model for
natural language processing
. Despite their impressive downstream performance, it can be difficult to apply PLMs to new languages, a barrier to mak
→