可解释的大赦国际的反事实评价 (Counterfactual Evaluation for Explainable AI) - 专知论文

会员服务 ·

0

Continuity · 离散化 · MoDELS · SimPLe · 真实值 ·

2021 年 9 月 5 日

Counterfactual Evaluation for Explainable AI

翻译：可解释的大赦国际的反事实评价

Yingqiang Ge,Shuchang Liu,Zelong Li,Shuyuan Xu,Shijie Geng,Yunqi Li,Juntao Tan,Fei Sun,Yongfeng Zhang

While recent years have witnessed the emergence of various explainable methods in machine learning, to what degree the explanations really represent the reasoning process behind the model prediction -- namely, the faithfulness of explanation -- is still an open problem. One commonly used way to measure faithfulness is \textit{erasure-based} criteria. Though conceptually simple, erasure-based criterion could inevitably introduce biases and artifacts. We propose a new methodology to evaluate the faithfulness of explanations from the \textit{counterfactual reasoning} perspective: the model should produce substantially different outputs for the original input and its corresponding counterfactual edited on a faithful feature. Specially, we introduce two algorithms to find the proper counterfactuals in both discrete and continuous scenarios and then use the acquired counterfactuals to measure faithfulness. Empirical results on several datasets show that compared with existing metrics, our proposed counterfactual evaluation method can achieve top correlation with the ground truth under diffe

翻译：虽然近年来在机器学习中出现了各种可解释的方法,但解释在多大程度上真正代表了模型预测背后的推理过程 -- -- 即解释的忠实性 -- -- 仍然是一个尚未解决的问题。衡量忠诚程度的一种常用方法就是 \ textit{erasure-basure basure} 标准。虽然概念简单,但基于删除的标准可能不可避免地引入偏见和人工制品。我们提出了一个新方法,从\ textit{counterfactal 推理}的角度来评价解释解释的准确性:模型应该为原始输入及其对应的反事实产生大不相同的产出,并按忠实特征编辑。特别是,我们引入两种算法,在离散和连续的情景中找到适当的反事实,然后使用获得的反事实来衡量忠诚程度。几个数据集的实证结果显示,与现有的指标相比,我们提议的反事实评价方法可以实现与地面真相的最大关联。

1

相关内容

Continuity

让 iOS 8 和 OS X Yosemite 无缝切换的一个新特性。 > Apple products have always been designed to work together beautifully. But now they may really surprise you. With iOS 8 and OS X Yosemite, you’ll be able to do more wonderful things than ever before.

Source: Apple - iOS 8

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

专知会员服务

32+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

已删除

将门创投

9+阅读 · 2019年11月15日

Assessing Evaluation Metrics for Speech-to-Speech Translation

Assessing Evaluation Metrics for Speech-to-Speech Translation

Arxiv

0+阅读 · 2021年10月26日

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Arxiv

0+阅读 · 2021年10月26日

Double Trouble: How to not explain a text classifier's decisions using counterfactuals synthesized by masked language models?

Arxiv

0+阅读 · 2021年10月22日

Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Arxiv

0+阅读 · 2021年10月21日

Explaining Algorithmic Fairness Through Fairness-Aware Causal Path Decomposition

Arxiv

4+阅读 · 2021年8月11日

Optimal Counterfactual Explanations in Tree Ensembles

Arxiv

5+阅读 · 2021年6月25日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Counterfactual Explanations for Machine Learning: A Review

Arxiv

25+阅读 · 2020年10月20日

GNNExplainer: Generating Explanations for Graph Neural Networks

GNNExplainer: Generating Explanations for Graph Neural Networks

Arxiv

4+阅读 · 2019年11月13日

Towards Explainable NLP: A Generative Explanation Framework for Text Classification

Arxiv

3+阅读 · 2019年6月11日

VIP会员

文章信息

相关主题

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

【Freddy Lecue 博士|公开演讲】可解释XAI人工智能进展（Explainable AI-The Story So Far），Sungkyunkwan University 2019

专知会员服务

32+阅读 · 2019年11月11日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

已删除

将门创投

9+阅读 · 2019年11月15日

相关论文

Assessing Evaluation Metrics for Speech-to-Speech Translation

Assessing Evaluation Metrics for Speech-to-Speech Translation

Arxiv

0+阅读 · 2021年10月26日

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Arxiv

0+阅读 · 2021年10月26日

Double Trouble: How to not explain a text classifier's decisions using counterfactuals synthesized by masked language models?

Arxiv

0+阅读 · 2021年10月22日

Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

Arxiv

0+阅读 · 2021年10月21日

Explaining Algorithmic Fairness Through Fairness-Aware Causal Path Decomposition

Arxiv

4+阅读 · 2021年8月11日

Optimal Counterfactual Explanations in Tree Ensembles

Arxiv

5+阅读 · 2021年6月25日

Counterfactual Zero-Shot and Open-Set Visual Recognition

Arxiv

12+阅读 · 2021年3月1日

Counterfactual Explanations for Machine Learning: A Review

Arxiv

25+阅读 · 2020年10月20日

GNNExplainer: Generating Explanations for Graph Neural Networks

GNNExplainer: Generating Explanations for Graph Neural Networks

Arxiv

4+阅读 · 2019年11月13日

Towards Explainable NLP: A Generative Explanation Framework for Text Classification

Arxiv

3+阅读 · 2019年6月11日

微信扫码咨询专知VIP会员