执行可解释性及其统计影响:准确性和可解释性之间的取舍 (Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability) - 专知论文

会员服务 ·

0

统计量 · 模型评估 · 经验风险最小化 · 经验风险 · Performer ·

2020 年 10 月 26 日

Enforcing Interpretability and its Statistical Impacts: Trade-offs between Accuracy and Interpretability

翻译：执行可解释性及其统计影响:准确性和可解释性之间的取舍

Gintare Karolina Dziugaite,Shai Ben-David,Daniel M. Roy

from arxiv, 12 pages

To date, there has been no formal study of the statistical cost of interpretability in machine learning. As such, the discourse around potential trade-offs is often informal and misconceptions abound. In this work, we aim to initiate a formal study of these trade-offs. A seemingly insurmountable roadblock is the lack of any agreed upon definition of interpretability. Instead, we propose a shift in perspective. Rather than attempt to define interpretability, we propose to model the \emph{act} of \emph{enforcing} interpretability. As a starting point, we focus on the setting of empirical risk minimization for binary classification, and view interpretability as a constraint placed on learning. That is, we assume we are given a subset of hypothesis that are deemed to be interpretable, possibly depending on the data distribution and other aspects of the context. We then model the act of enforcing interpretability as that of performing empirical risk minimization over the set of interpretable hypotheses. This model allows us to reason about the statistical implications of enforcing interpretability, using known results in statistical learning theory. Focusing on accuracy, we perform a case analysis, explaining why one may or may not observe a trade-off between accuracy and interpretability when the restriction to interpretable classifiers does or does not come at the cost of some excess statistical risk. We close with some worked examples and some open problems, which we hope will spur further theoretical development around the tradeoffs involved in interpretability.

翻译：迄今为止,还没有正式研究机器学习解释的统计成本。因此,关于潜在权衡的论述往往是非正式的,误解也很多。在这项工作中,我们的目标是开始对这些权衡进行正式研究。一个看似不可逾越的障碍是缺乏任何关于解释定义的一致意见。相反,我们提议改变观点。我们提议,与其试图界定解释的可解释性,不如以模型为模型。作为一个起点,我们侧重于为二进制分类设定经验风险最小化,并将可解释性视为对学习的制约。在这项工作中,我们假定我们得到了一套假设,这些假设被认为是可以解释的,可能取决于数据分配和背景的其他方面。我们然后将执行解释性的行为作为执行对一套可解释的假设进行最小化的经验风险最小化的模型。这个模型让我们能够解释执行可解释性所涉的统计问题,使用已知的统计学习理论结果。侧重于准确性、我们进行案例分析,并解释某些可能无法解释的精确性,或者在统计风险发生时,我们无法精确性地解释一些解释。

0

相关内容

统计量

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

机器学习大礼包 | 课程、数据集、面试题免费送！

机器学习大礼包 | 课程、数据集、面试题免费送！

九章算法

9+阅读 · 2018年2月16日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

The Depth-to-Width Interplay in Self-Attention

The Depth-to-Width Interplay in Self-Attention

Arxiv

1+阅读 · 2020年12月9日

A Statistical Test for Probabilistic Fairness

Arxiv

0+阅读 · 2020年12月9日

Multi-Objective Interpolation Training for Robustness to Label Noise

Multi-Objective Interpolation Training for Robustness to Label Noise

Arxiv

0+阅读 · 2020年12月8日

A review of possible effects of cognitive biases on the interpretation of rule-based machine learning models

A review of possible effects of cognitive biases on the interpretation of rule-based machine learning models

Arxiv

0+阅读 · 2020年12月7日

Algebraic geometry of discrete interventional models

Arxiv

0+阅读 · 2020年12月7日

A Weighted Solution to SVM Actionability and Interpretability

Arxiv

0+阅读 · 2020年12月6日

Proactive Pseudo-Intervention: Causally Informed Contrastive Learning For Interpretable Vision Models

Arxiv

1+阅读 · 2020年12月6日

Learning Interpretable Concept-Based Models with Human Feedback

Arxiv

0+阅读 · 2020年12月4日

An Empirical Study on the Relation between Network Interpretability and Adversarial Robustness

Arxiv

0+阅读 · 2020年12月4日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

VIP会员

文章信息

相关主题

经验风险最小化

相关VIP内容

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

《可解释的机器学习-interpretable-ml》238页pdf

《可解释的机器学习-interpretable-ml》238页pdf

专知会员服务

208+阅读 · 2020年2月24日

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

【MIT】图神经网络的泛化与表示极限，《Generalization and Representational Limits of Graph Neural Networks》

专知会员服务

46+阅读 · 2020年2月23日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

可解释机器学习（Interpretable Machine Learning）：打开黑盒之谜（238页书籍下载）

专知会员服务

152+阅读 · 2019年10月27日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

未来战场：AI赋能无人作战新范式，39页ppt

【牛津博士论文】无限维空间中的广义变分推断

DeepSeek AI 从入门到付费专家·第一卷：动手实践、真实应用与可扩展 AI 解决方案全掌握

2025中国AI Agent商业应用场景洞察研究

相关资讯

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

机器学习大礼包 | 课程、数据集、面试题免费送！

机器学习大礼包 | 课程、数据集、面试题免费送！

九章算法

9+阅读 · 2018年2月16日

人工智能 | 国际会议/SCI期刊约稿信息9条

人工智能 | 国际会议/SCI期刊约稿信息9条

Call4Papers

3+阅读 · 2018年1月12日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【今日新增】IEEE Trans.专刊截稿信息8条

【今日新增】IEEE Trans.专刊截稿信息8条

Call4Papers

7+阅读 · 2017年6月29日

相关论文

The Depth-to-Width Interplay in Self-Attention

The Depth-to-Width Interplay in Self-Attention

Arxiv

1+阅读 · 2020年12月9日

A Statistical Test for Probabilistic Fairness

Arxiv

0+阅读 · 2020年12月9日

Multi-Objective Interpolation Training for Robustness to Label Noise

Multi-Objective Interpolation Training for Robustness to Label Noise

Arxiv

0+阅读 · 2020年12月8日

A review of possible effects of cognitive biases on the interpretation of rule-based machine learning models

A review of possible effects of cognitive biases on the interpretation of rule-based machine learning models

Arxiv

0+阅读 · 2020年12月7日

Algebraic geometry of discrete interventional models

Arxiv

0+阅读 · 2020年12月7日

A Weighted Solution to SVM Actionability and Interpretability

Arxiv

0+阅读 · 2020年12月6日

Proactive Pseudo-Intervention: Causally Informed Contrastive Learning For Interpretable Vision Models

Arxiv

1+阅读 · 2020年12月6日

Learning Interpretable Concept-Based Models with Human Feedback

Arxiv

0+阅读 · 2020年12月4日

An Empirical Study on the Relation between Network Interpretability and Adversarial Robustness

Arxiv

0+阅读 · 2020年12月4日

Learning with Interpretable Structure from RNN

Arxiv

19+阅读 · 2018年10月25日

微信扫码咨询专知VIP会员