BriefGPT.xyz
Dec, 2023
TaCo:基于信息论和可解释性的自然语言处理中的目标概念删除
TaCo: Targeted Concept Removal in Output Embeddings for NLP via Information Theory and Explainability
HTML
PDF
Fanny Jourdan, Louis Béthune, Agustin Picard, Laurent Risser, Nicholas Asher
TL;DR
通过嵌入变换消除NLP模型中的隐性信息以减少性别相关联系,同时保留模型的整体性能和功能。
Abstract
The
fairness
of
natural language processing
(NLP) models has emerged as a crucial concern. Information theory indicates that to achieve
fairness<
→