从人类解释到模型解释:以证据的权衡为基础的框架 (From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence) - 专知论文

会员服务 ·

0

Weight · INFORMS · 估计/估计量 · MoDELS · Principle ·

2021 年 9 月 20 日

From Human Explanation to Model Interpretability: A Framework Based on Weight of Evidence

翻译：从人类解释到模型解释:以证据的权衡为基础的框架

David Alvarez-Melis,Harmanpreet Kaur,Hal Daumé III,Hanna Wallach,Jennifer Wortman Vaughan

from arxiv, HCOMP 2021

We take inspiration from the study of human explanation to inform the design and evaluation of interpretability methods in machine learning. First, we survey the literature on human explanation in philosophy, cognitive science, and the social sciences, and propose a list of design principles for machine-generated explanations that are meaningful to humans. Using the concept of weight of evidence from information theory, we develop a method for generating explanations that adhere to these principles. We show that this method can be adapted to handle high-dimensional, multi-class settings, yielding a flexible framework for generating explanations. We demonstrate that these explanations can be estimated accurately from finite samples and are robust to small perturbations of the inputs. We also evaluate our method through a qualitative user study with machine learning practitioners, where we observe that the resulting explanations are usable despite some participants struggling with background concepts like prior class probabilities. Finally, we conclude by surfacing~design~implications for interpretability tools in general.

翻译：我们从人类解释的研究中得到灵感,为机器学习解释方法的设计和评价提供参考。首先,我们调查哲学、认知科学和社会科学中人类解释的文献,提出对人类有意义的机器解释的设计原则清单。我们利用信息理论中证据的权重概念,开发了符合这些原则的解释方法。我们表明,这种方法可以适应高维、多级设置,产生一个灵活的解释框架。我们证明,这些解释可以从有限的样本中准确估计,并且能够对投入进行小的干扰。我们还通过与机器学习实践者进行质量用户研究来评估我们的方法。我们发现,尽管一些参与者在与前几类概率等背景概念斗争,但由此产生的解释仍然有用。最后,我们通过对一般可解释工具的表面设计来得出结论。

0

相关内容

Weight

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【ECML-PKDD 2019】突破可解释性障碍——解释深度图卷积模型的一种方法（Breaking the interpretability barrier - a methodfor interpreting deep graph convolutional models）

【ECML-PKDD 2019】突破可解释性障碍——解释深度图卷积模型的一种方法（Breaking the interpretability barrier - a methodfor interpreting deep graph convolutional models）

专知会员服务

19+阅读 · 2019年12月1日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

【伯克利PNAS最新论文】可解释机器学习的定义、方法和应用（Definitions, methods, and applications in interpretable machine learning）,W. James Murdoch,Chandan Singh

【伯克利PNAS最新论文】可解释机器学习的定义、方法和应用（Definitions, methods, and applications in interpretable machine learning）,W. James Murdoch,Chandan Singh

专知会员服务

55+阅读 · 2019年11月20日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Can Information Flows Suggest Targets for Interventions in Neural Circuits?

Arxiv

0+阅读 · 2021年11月9日

Robust walking based on MPC with viability guarantees

Arxiv

0+阅读 · 2021年11月9日

Counterfactual Explanations as Interventions in Latent Space

Arxiv

0+阅读 · 2021年11月8日

NeurInt : Learning to Interpolate through Neural ODEs

Arxiv

0+阅读 · 2021年11月7日

Interpretable, predictive spatio-temporal models via enhanced Pairwise Directions Estimation

Arxiv

0+阅读 · 2021年11月6日

Trustworthy AI: From Principles to Practices

Arxiv

46+阅读 · 2021年10月4日

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Arxiv

4+阅读 · 2021年4月26日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Interpretable Active Learning

Interpretable Active Learning

Arxiv

3+阅读 · 2018年6月24日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【ECML-PKDD 2019】突破可解释性障碍——解释深度图卷积模型的一种方法（Breaking the interpretability barrier - a methodfor interpreting deep graph convolutional models）

【ECML-PKDD 2019】突破可解释性障碍——解释深度图卷积模型的一种方法（Breaking the interpretability barrier - a methodfor interpreting deep graph convolutional models）

专知会员服务

19+阅读 · 2019年12月1日

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

【ECML-PKDD 2019】基于bagged-trees学习的可解释生存梯度提升模型（Interpretable survival gradient boosting models with bagged trees base learners）

专知会员服务

6+阅读 · 2019年12月1日

【伯克利PNAS最新论文】可解释机器学习的定义、方法和应用（Definitions, methods, and applications in interpretable machine learning）,W. James Murdoch,Chandan Singh

【伯克利PNAS最新论文】可解释机器学习的定义、方法和应用（Definitions, methods, and applications in interpretable machine learning）,W. James Murdoch,Chandan Singh

专知会员服务

55+阅读 · 2019年11月20日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《乌克兰无人机产业：志愿者与政策在构建新兴无人机产业中的协同作用》最新报告

《人工智能辅助决策中的数据可视化：系统性综述》

人工智能驱动弹药制造现代化：美国陆军转型之路

《敏捷作战部署中枢纽-辐条基地选址优化研究》80页

相关资讯

Successor representations 强化学习表示的生物学启发

Successor representations 强化学习表示的生物学启发

CreateAMind

6+阅读 · 2019年9月5日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

【论文推荐】最新7篇视觉问答（VQA）相关论文—解释、读写记忆网络、逆视觉问答、视觉推理、可解释性、注意力机制、计数

专知

30+阅读 · 2018年3月22日

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

最新5篇生成对抗网络相关论文推荐—FusedGAN、DeblurGAN、AdvGAN、CipherGAN、MMD GANS

专知

23+阅读 · 2018年1月18日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

深度学习医学图像分析文献集

深度学习医学图像分析文献集

机器学习研究会

19+阅读 · 2017年10月13日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Can Information Flows Suggest Targets for Interventions in Neural Circuits?

Arxiv

0+阅读 · 2021年11月9日

Robust walking based on MPC with viability guarantees

Arxiv

0+阅读 · 2021年11月9日

Counterfactual Explanations as Interventions in Latent Space

Arxiv

0+阅读 · 2021年11月8日

NeurInt : Learning to Interpolate through Neural ODEs

Arxiv

0+阅读 · 2021年11月7日

Interpretable, predictive spatio-temporal models via enhanced Pairwise Directions Estimation

Arxiv

0+阅读 · 2021年11月6日

Trustworthy AI: From Principles to Practices

Arxiv

46+阅读 · 2021年10月4日

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Arxiv

4+阅读 · 2021年4月26日

Interpretable machine learning: definitions, methods, and applications

Interpretable machine learning: definitions, methods, and applications

Arxiv

19+阅读 · 2019年1月14日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

Interpretable Active Learning

Interpretable Active Learning

Arxiv

3+阅读 · 2018年6月24日

微信扫码咨询专知VIP会员