BriefGPT.xyz
Oct, 2024
自我监督学习中的泛化与因果解释
On the Generalization and Causal Explanation in Self-Supervised Learning
HTML
PDF
Wenwen Qiang, Zeen Song, Ziyin Gu, Jiangmeng Li, Changwen Zheng...
TL;DR
本研究关注自我监督学习中的过拟合问题及其对模型适应新任务能力的影响。通过实验,我们观察到过拟合在后期层和训练后期突发,而特征泛化则在早期层通过引入解除记忆机制(UMM)得以改善。研究表明,UMM显著提升了自我监督学习方法在下游任务上的泛化性能。
Abstract
Self-supervised Learning
(SSL) methods learn from unlabeled data and achieve high
Generalization
performance on downstream tasks. However, they may also suffer from
→