BriefGPT.xyz
Apr, 2025
MiMu:减轻变换器的多重捷径学习行为
MiMu: Mitigating Multiple Shortcut Learning Behavior of Transformers
HTML
PDF
Lili Zhao, Qi Liu, Wei Chen, Liyi Chen, Ruijun Sun...
TL;DR
本研究解决了现有模型在学习过程中依赖特征与标签之间的虚假相关性,导致捷径学习行为的问题。提出的MiMu方法通过自校准和自改进策略,减轻模型对多种捷径的依赖,从而提高其稳健性和泛化性能。实验结果显示,该方法在自然语言处理和计算机视觉任务中显著提升了模型的鲁棒性和泛化能力。
Abstract
Empirical Risk Minimization (ERM) models often rely on spurious correlations between features and labels during the learning process, leading to
Shortcut Learning
behavior that undermines
Robustness
→