基础模型的嵌入表示或许能够检测分布偏移

Oct, 2023

Foundation Model's Embedded Representations May Detect Distribution Shift

Adam Tsou, Max Vargas, Andrew Engel, Tony Chiang

TL;DR通过在预训练的GPT-2模型上进行情感分类的迁移学习案例研究，我们发现训练集和测试集之间的分布变化使我们无法准确了解神经网络模型的泛化能力。

Abstract

distribution shifts between train and test datasets obscure our ability to understand the generalization capacity of →