BriefGPT.xyz
Oct, 2023
利用生成的LLM的反事实文本来解释黑盒NLP模型
Faithful Explanations of Black-box NLP Models Using LLM-generated Counterfactuals
HTML
PDF
Yair Gat, Nitay Calderon, Amir Feder, Alexander Chapanin, Amit Sharma...
TL;DR
解释自然语言处理系统预测的因果解释对于确保安全性和建立信任至关重要,本文提出了两种针对模型无关性的倒因果估算方法,分别基于生成和匹配,并通过实验证明了生成模型和匹配模型在模型解释方面的出色性能。
Abstract
causal explanations
of the predictions of
nlp systems
are essential to ensure safety and establish trust. Yet, existing methods often fall short of explaining model predictions effectively or efficiently and are
→