BriefGPT.xyz
Feb, 2022
语言数据影响上,先驱胜于追随
First is Better Than Last for Training Data Influence
HTML
PDF
Chih-Kuan Yeh, Ankur Taly, Mukund Sundararajan, Frederick Liu, Pradeep Ravikumar
TL;DR
该研究针对NLP应用中大型模型在调试训练数据和解释模型行为时计算影响力的问题,提出了一种名为TracIn-WE的技术,该技术基于词嵌入层进行数据影响力分析,能够获得较高的影响力得分,可有效调试。
Abstract
The ability to identify
influential training examples
enables us to debug training data and explain
model behavior
. Existing techniques are based on the flow of influence through the model parameters. For large m
→