BriefGPT.xyz
Oct, 2021
深度模型的一致对法推断
Consistent Counterfactuals for Deep Models
HTML
PDF
Emily Black, Zifan Wang, Matt Fredrikson, Anupam Datta
TL;DR
本篇论文研究了深度网络下,机器学习模型一致性与反事实样本的关系,在权重初始化和数据保留等初步训练条件发生微小变化时,提出了通过稳定邻近搜索算法来生成更一致解释的反事实样本的方法,并证明了该方法在多个基准数据集上具有有效性。
Abstract
Counterfactual examples are one of the most commonly-cited methods for explaining the predictions of
machine learning models
in key areas such as finance and medical diagnosis.
counterfactuals
are often discussed
→