BriefGPT.xyz
Mar, 2021
模型可解释性的对照解释
Contrastive Explanations for Model Interpretability
HTML
PDF
Alon Jacovi, Swabha Swayamdipta, Shauli Ravfogel, Yanai Elazar, Yejin Choi...
TL;DR
该研究提出了一种利用潜空间对分类模型进行对比解释的方法,可以对输入的文本进行高、低级别的概念和属性归纳分析,以实现更准确、细粒度的模型可解释性。
Abstract
contrastive explanations
clarify why an event occurred in contrast to another. They are more inherently intuitive to humans to both produce and comprehend. We propose a methodology to produce
contrastive explanations
→