深度学习模型的背景数据大小对SHapley Additive exPlanations（SHAP）稳定性的影响的实证研究 (An empirical study of the effect of background data size on the stability of SHapley Additive exPlanations (SHAP) for deep learning models) - 专知论文

会员服务 ·

0

实证研究 · 数据集 · 波动 · 深度学习模型 · 学习模型 ·

2023 年 4 月 9 日

An empirical study of the effect of background data size on the stability of SHapley Additive exPlanations (SHAP) for deep learning models

翻译：深度学习模型的背景数据大小对SHapley Additive exPlanations（SHAP）稳定性的影响的实证研究

Han Yuan,Mingxuan Liu,Lican Kang,Chenkui Miao,Ying Wu

Nowadays, the interpretation of why a machine learning (ML) model makes certain inferences is as crucial as the accuracy of such inferences. Some ML models like the decision tree possess inherent interpretability that can be directly comprehended by humans. Others like artificial neural networks (ANN), however, rely on external methods to uncover the deduction mechanism. SHapley Additive exPlanations (SHAP) is one of such external methods, which requires a background dataset when interpreting ANNs. Generally, a background dataset consists of instances randomly sampled from the training dataset. However, the sampling size and its effect on SHAP remain to be unexplored. In our empirical study on the MIMIC-III dataset, we show that the two core explanations - SHAP values and variable rankings fluctuate when using different background datasets acquired from random sampling, indicating that users cannot unquestioningly trust the one-shot interpretation from SHAP. Luckily, such fluctuation decreases with the increase of the background dataset size. Also, we notice an U-shape in the stability assessment of SHAP variable rankings, demonstrating that SHAP is more reliable in ranking the most and least important variables compared to moderately important ones. Overall, our results suggest that users should take into account how background data affects SHAP results, with improved SHAP stability as the background sample size increases.

翻译：现今，解释为何机器学习（ML）模型做出某些推论的解释方式和推论准确率一样重要。像决策树这样的ML模型具有直接被人理解的内在可解释性。然而，其他的模型，如人工神经网络（ANN）则依赖外部方法来揭示推导机制。SHapley Additive exPlanations（SHAP）就是这样一种需要背景数据集的外部方法来解释ANNs。一般情况下，背景数据集包含从训练数据集中随机抽取的实例。然而，抽样大小及其对SHap的影响仍未被研究。在我们对MIMIC-III数据集的实证研究中，我们展示了两种核心解释--SHAP值和变量排名--在使用从随机抽样获得的不同的背景数据集时会波动，这表明用户不能毫无保留地相信SHAP的一次性解释。幸运的是，此类波动随着背景数据集大小的增加而减少。此外，我们注意到SHAP变量排名稳定性评估中的U形演变，表明与适度重要变量相比，SHAP在排列最重要和最不重要的变量时更可靠。总体上，我们的结果表明，用户应该考虑背景数据如何影响SHAP的结果，随着背景样本大小的增加，SHAP的稳定性得到了改善。

0

相关内容

实证研究

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

136+阅读 · 2020年5月1日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

深度学习高温蒸馏：Softmax With Temperature

深度学习高温蒸馏：Softmax With Temperature

PaperWeekly

2+阅读 · 2022年11月23日

论文浅尝 | Continual Learning for Named Entity Recognition

论文浅尝 | Continual Learning for Named Entity Recognition

开放知识图谱

1+阅读 · 2022年6月25日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

机器学习可解释性工具箱XAI

机器学习可解释性工具箱XAI

专知

11+阅读 · 2019年2月8日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

一位ML工程师构建深度神经网络的实用技巧

一位ML工程师构建深度神经网络的实用技巧

AI100

11+阅读 · 2018年9月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

纵向数据因果推断中的双稳健半参数效应模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

关于面板(纵向）数据的动态统计分析

国家自然科学基金

0+阅读 · 2014年12月31日

地下水流数值模拟概念模型的不确定性分析

国家自然科学基金

0+阅读 · 2013年12月31日

全球气候变化下黄脊竹蝗Ceracris kiangsu Tsai发生的物候学模型

国家自然科学基金

0+阅读 · 2013年12月31日

基于秩次的有序分类纵向数据非参数方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

统计学习理论中的分位数回归和MEE算法

国家自然科学基金

1+阅读 · 2012年12月31日

藤黄酸抗B细胞非霍奇金淋巴瘤新机制- - 调控SRC-3/组蛋白乙酰化转录复合物SUMO化修饰

国家自然科学基金

0+阅读 · 2012年12月31日

基于纵向数据的秩回归和分位数回归的有效参数估计

国家自然科学基金

0+阅读 · 2012年12月31日

初值扰动方法及其对短期气候集合预测的影响

国家自然科学基金

1+阅读 · 2011年12月31日

复杂模型的变量选择及其在流行病学中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

CALIME: Causality-Aware Local Interpretable Model-Agnostic Explanations

Arxiv

0+阅读 · 2023年5月25日

Quantifying the Intrinsic Usefulness of Attributional Explanations for Graph Neural Networks with Artificial Simulatability Studies

Arxiv

0+阅读 · 2023年5月25日

FairShap: A Data Re-weighting Approach for Algorithmic Fairness based on Shapley Values

Arxiv

0+阅读 · 2023年5月24日

Learning the String Partial Order

Arxiv

0+阅读 · 2023年5月24日

Active Learning for Natural Language Generation

Arxiv

0+阅读 · 2023年5月24日

Interpretation and visualization of distance covariance through additive decomposition of correlations formula

Arxiv

0+阅读 · 2023年5月24日

DF2M: An Explainable Deep Bayesian Nonparametric Model for High-Dimensional Functional Time Series

Arxiv

0+阅读 · 2023年5月23日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

Arxiv

17+阅读 · 2021年7月10日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

VIP会员

文章信息

相关主题

深度学习模型

相关VIP内容

【2022新书】高效深度学习，Efficient Deep Learning Book

【2022新书】高效深度学习，Efficient Deep Learning Book

专知会员服务

125+阅读 · 2022年4月21日

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

136+阅读 · 2020年5月1日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

【贝叶斯深度学习：一种基于模型的可解释方法】Bayesian deep learning: A model-based interpretable approach

专知会员服务

49+阅读 · 2020年1月1日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

兵棋系统文档：联合战区级模拟-全球行动（JTLS-GO®）

【普林斯顿博士论文】面向人本机器人学的安全与学习博弈论融合

从无人机到数据：揭示边缘计算作为新作战域

综述：机器嗅觉与嵌入式人工智能正在塑造新的全球传感产业

相关资讯

深度学习高温蒸馏：Softmax With Temperature

深度学习高温蒸馏：Softmax With Temperature

PaperWeekly

2+阅读 · 2022年11月23日

论文浅尝 | Continual Learning for Named Entity Recognition

论文浅尝 | Continual Learning for Named Entity Recognition

开放知识图谱

1+阅读 · 2022年6月25日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

机器学习可解释性工具箱XAI

机器学习可解释性工具箱XAI

专知

11+阅读 · 2019年2月8日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

一位ML工程师构建深度神经网络的实用技巧

一位ML工程师构建深度神经网络的实用技巧

AI100

11+阅读 · 2018年9月12日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

相关论文

CALIME: Causality-Aware Local Interpretable Model-Agnostic Explanations

Arxiv

0+阅读 · 2023年5月25日

Quantifying the Intrinsic Usefulness of Attributional Explanations for Graph Neural Networks with Artificial Simulatability Studies

Arxiv

0+阅读 · 2023年5月25日

FairShap: A Data Re-weighting Approach for Algorithmic Fairness based on Shapley Values

Arxiv

0+阅读 · 2023年5月24日

Learning the String Partial Order

Arxiv

0+阅读 · 2023年5月24日

Active Learning for Natural Language Generation

Arxiv

0+阅读 · 2023年5月24日

Interpretation and visualization of distance covariance through additive decomposition of correlations formula

Arxiv

0+阅读 · 2023年5月24日

DF2M: An Explainable Deep Bayesian Nonparametric Model for High-Dimensional Functional Time Series

Arxiv

0+阅读 · 2023年5月23日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

Interpretable Machine Learning: Fundamental Principles and 10 Grand Challenges

Arxiv

17+阅读 · 2021年7月10日

Visual Interpretability for Deep Learning: a Survey

Arxiv

16+阅读 · 2018年2月7日

相关基金

纵向数据因果推断中的双稳健半参数效应模型研究

国家自然科学基金

5+阅读 · 2014年12月31日

关于面板(纵向）数据的动态统计分析

国家自然科学基金

0+阅读 · 2014年12月31日

地下水流数值模拟概念模型的不确定性分析

国家自然科学基金

0+阅读 · 2013年12月31日

全球气候变化下黄脊竹蝗Ceracris kiangsu Tsai发生的物候学模型

国家自然科学基金

0+阅读 · 2013年12月31日

基于秩次的有序分类纵向数据非参数方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

统计学习理论中的分位数回归和MEE算法

国家自然科学基金

1+阅读 · 2012年12月31日

藤黄酸抗B细胞非霍奇金淋巴瘤新机制- - 调控SRC-3/组蛋白乙酰化转录复合物SUMO化修饰

国家自然科学基金

0+阅读 · 2012年12月31日

基于纵向数据的秩回归和分位数回归的有效参数估计

国家自然科学基金

0+阅读 · 2012年12月31日

初值扰动方法及其对短期气候集合预测的影响

国家自然科学基金

1+阅读 · 2011年12月31日

复杂模型的变量选择及其在流行病学中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员