BriefGPT.xyz
Mar, 2024
无监督预训练的泛化能力研究
On the Generalization Ability of Unsupervised Pretraining
HTML
PDF
Yuyang Deng, Junyuan Hong, Jiayu Zhou, Mehrdad Mahdavi
TL;DR
运用一种新的理论框架,研究无监督预训练对细调模型泛化能力的影响,并通过分析两个具体场景的泛化上限,提出了一种新的预训练正则化方法,从而促进了细调模型的泛化能力。
Abstract
Recent advances in
unsupervised learning
have shown that unsupervised
pre-training
, followed by
fine-tuning
, can improve model
→