BriefGPT.xyz
May, 2023
通过基于梯度的表示解释来消除模型偏差
Model Debiasing via Gradient-based Explanation on Representation
HTML
PDF
Jindi Zhang, Luning Wang, Dan Su, Yongxiang Huang, Caleb Chen Cao...
TL;DR
本文提出了一种基于代理属性与敏感属性的公平性框架,通过梯度解释找到模型关注点,再利用它们来指导下游任务模型的训练,以实现公平性和效用之间的平衡。实验证明,我们的框架在非分离和分离表示学习方法上具有更好的公平性-精度平衡。
Abstract
machine learning
systems produce biased results towards certain demographic groups, known as the
fairness problem
. Recent approaches to tackle this problem learn a latent code (i.e., representation) through disen
→