BriefGPT.xyz
Dec, 2023
利用反事实对齐方法识别虚假相关性
Identifying Spurious Correlations using Counterfactual Alignment
HTML
PDF
Joseph Paul Cohen, Louis Blankemeier, Akshay Chaudhari
TL;DR
通过计算输入不同分类器后输出的响应之间的关系,我们提出了反事实对齐方法来检测和探索黑盒分类器中的虚假相关性,并验证了在人脸属性分类器中检测到虚假相关性的能力,同时证明了可以通过CF对齐方法纠正分类器中检测到的虚假相关性。
Abstract
Models driven by
spurious correlations
often yield poor generalization performance. We propose the
counterfactual alignment method
to detect and explore
→