BriefGPT.xyz
Mar, 2023
基于知识的反事实查询在视觉问答中的应用
Knowledge-Based Counterfactual Queries for Visual Question Answering
HTML
PDF
Theodoti Stoikou, Maria Lymperaiou, Giorgos Stamou
TL;DR
本文通过利用结构化知识库进行确定性、最优和可控的词级替换,以探究VQA模型行为的解释和鲁棒性,并从反事实的回答中提取局部和全局解释,发现可能的偏见和影响模型的性能的预期和意外模式,揭示了模型决策过程中的潜在偏见。
Abstract
visual question answering
(VQA) has been a popular task that combines vision and language, with numerous relevant implementations in literature. Even though there are some attempts that approach
explainability
an
→