使用 " 白箱模式 " 评估当地解释 (Evaluating Local Explanations using White-box Models)

Evaluating explanation techniques using human subjects is costly, time-consuming and can lead to subjectivity in the assessments. To evaluate the accuracy of local explanations, we require access to the true feature importance scores for a given instance. However, the prediction function of a model usually does not decompose into linear additive terms that indicate how much a feature contributes to the output. In this work, we suggest to instead focus on the log odds ratio (LOR) of the prediction function, which naturally decomposes into additive terms for logistic regression and naive Bayes. We demonstrate how we can benchmark different explanation techniques in terms of their similarity to the LOR scores based on our proposed approach. In the experiments, we compare prominent local explanation techniques and find that the performance of the techniques can depend on the underlying model, the dataset, which data point is explained, the normalization of the data and the similarity metric.

翻译：使用人类实验品来评价解释技术是昂贵的,耗时的,并可能导致评估的主观性。为了评价当地解释的准确性,我们需要获得某个特定例子的真正特征重要分数。然而,模型的预测功能通常不会分解成线性添加术语,以表明某一特征对产出的贡献程度。在这项工作中,我们建议把重点放在预测功能的日志概率比(LOR)上,这自然会分解成物流回归和天真的贝耶斯的添加术语。我们证明我们如何能够根据我们提议的方法,将不同解释技术与LOR分数的相似性作为基准。在实验中,我们比较了突出的地方解释技术,发现这些技术的性能取决于基本模型、数据集、数据点的解释、数据的正常化和类似性衡量标准。

相关内容

白盒

关注 0

白盒测试（也称为透明盒测试，玻璃盒测试，透明盒测试和结构测试）是一种软件测试方法，用于测试应用程序的内部结构或功能，而不是其功能（即黑盒测试）。在白盒测试中，系统的内部视角以及编程技能被用来设计测试用例。测试人员选择输入以遍历代码的路径并确定预期的输出。这类似于测试电路中的节点，在线测试（ICT）。白盒测试可以应用于软件测试过程的单元，集成和系统级别。尽管传统的测试人员倾向于将白盒测试视为在单元级别进行的，但如今它已越来越频繁地用于集成和系统测试。它可以测试单元内的路径，集成期间单元之间的路径以及系统级测试期间子系统之间的路径。

【PAISS 2021 教程】概率散度与生成式模型，92页ppt

专知会员服务

34+阅读 · 2021年11月30日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日