BriefGPT.xyz
Jul, 2023
可验证特征归因:后解释性与内在可解释性之间的桥梁
Verifiable Feature Attributions: A Bridge between Post Hoc Explainability and Inherent Interpretability
HTML
PDF
Usha Bhalla, Suraj Srinivas, Himabindu Lakkaraju
TL;DR
通过VerT方法,将黑盒模型转化为生成可信且可验证特征归因的模型,从而弥合了先前研究中的解释策略差距。
Abstract
With the increased deployment of
machine learning models
in various real-world applications, researchers and practitioners alike have emphasized the need for
explanations
of model behaviour. To this end, two broa
→