通过强力分析进行评估和解释的方法 (Evaluations and Methods for Explanation through Robustness Analysis) - 专知论文

会员服务 ·

0

稳健性 · contrastive · MoDELS · 情景 · 优化器 ·

2021 年 4 月 8 日

Evaluations and Methods for Explanation through Robustness Analysis

翻译：通过强力分析进行评估和解释的方法

Cheng-Yu Hsieh,Chih-Kuan Yeh,Xuanqing Liu,Pradeep Ravikumar,Seungyeon Kim,Sanjiv Kumar,Cho-Jui Hsieh

from arxiv, To appear in ICLR 2021

Feature based explanations, that provide importance of each feature towards the model prediction, is arguably one of the most intuitive ways to explain a model. In this paper, we establish a novel set of evaluation criteria for such feature based explanations by robustness analysis. In contrast to existing evaluations which require us to specify some way to "remove" features that could inevitably introduces biases and artifacts, we make use of the subtler notion of smaller adversarial perturbations. By optimizing towards our proposed evaluation criteria, we obtain new explanations that are loosely necessary and sufficient for a prediction. We further extend the explanation to extract the set of features that would move the current prediction to a target class by adopting targeted adversarial attack for the robustness analysis. Through experiments across multiple domains and a user study, we validate the usefulness of our evaluation criteria and our derived explanations.

翻译：以特征为基础的解释,对模型预测具有每个特征的重要性,可以说是解释模型的最直觉方法之一。在本文中,我们为这种基于特征的解释通过稳健性分析建立了一套新的评价标准。与要求我们指定某种“撤销”特征的方法以不可避免地引入偏见和人工制品的现有评价相比,我们利用较微妙的小型对抗性扰动概念。通过优化我们拟议的评价标准,我们获得新的解释,这些解释对于预测来说是不太必要和足够的。我们进一步扩展解释范围,通过采用有针对性的对抗性攻击进行稳健性分析,将目前的预测转移到目标类别。我们通过跨多个领域的试验和用户研究,验证我们的评价标准和衍生的解释的效用。

0

相关内容

稳健性

一图掌握《可解释人工智能XAI》操作指南

一图掌握《可解释人工智能XAI》操作指南

专知会员服务

60+阅读 · 2021年5月3日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

美国DARPA204页可解释人工智能文献综述论文《Explanation in Human-AI Systems》

专知会员服务

97+阅读 · 2020年1月9日

【综述】文献级机器翻译研究:方法与评价（A Survey on Document-level Machine Translation: Methods and Evaluation）

【综述】文献级机器翻译研究:方法与评价（A Survey on Document-level Machine Translation: Methods and Evaluation）

专知会员服务

7+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

4+阅读 · 2018年7月31日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Arxiv

0+阅读 · 2021年6月3日

Counterfactual Explanation with Multi-Agent Reinforcement Learning for Drug Target Prediction

Arxiv

0+阅读 · 2021年6月2日

Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach

Arxiv

0+阅读 · 2021年6月1日

To trust or not to trust an explanation: using LEAF to evaluate local linear XAI methods

Arxiv

0+阅读 · 2021年6月1日

Model-based Adversarial Meta-Reinforcement Learning

Arxiv

5+阅读 · 2020年6月16日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Arxiv

3+阅读 · 2019年9月3日

Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness

Arxiv

3+阅读 · 2019年2月21日

A Framework for Evaluating 6-DOF Object Trackers

Arxiv

6+阅读 · 2018年3月28日

VIP会员

文章信息

相关主题

相关VIP内容

一图掌握《可解释人工智能XAI》操作指南

一图掌握《可解释人工智能XAI》操作指南

专知会员服务

60+阅读 · 2021年5月3日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

美国DARPA204页可解释人工智能文献综述论文《Explanation in Human-AI Systems》

专知会员服务

97+阅读 · 2020年1月9日

【综述】文献级机器翻译研究:方法与评价（A Survey on Document-level Machine Translation: Methods and Evaluation）

【综述】文献级机器翻译研究:方法与评价（A Survey on Document-level Machine Translation: Methods and Evaluation）

专知会员服务

7+阅读 · 2019年12月19日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

MIT新书《强化学习与最优控制》

MIT新书《强化学习与最优控制》

专知会员服务

280+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

新质生成式AI赋能产业变革的实践与路径

用于多模态大模型的离散标记化：全面综述

Nature综述：金融网络中的物理学

【CMU博士论文】通信高效且差分隐私的优化方法

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

IEEE | DSC 2019诚邀稿件 (EI检索)

IEEE | DSC 2019诚邀稿件 (EI检索)

Call4Papers

10+阅读 · 2019年2月25日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

4+阅读 · 2018年7月31日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

计算机视觉近一年进展综述

计算机视觉近一年进展综述

机器学习研究会

9+阅读 · 2017年11月25日

可解释的CNN

可解释的CNN

CreateAMind

17+阅读 · 2017年10月5日

相关论文

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Sample Selection Bias in Evaluation of Prediction Performance of Causal Models

Arxiv

0+阅读 · 2021年6月3日

Counterfactual Explanation with Multi-Agent Reinforcement Learning for Drug Target Prediction

Arxiv

0+阅读 · 2021年6月2日

Explaining Data-Driven Decisions made by AI Systems: The Counterfactual Approach

Arxiv

0+阅读 · 2021年6月1日

To trust or not to trust an explanation: using LEAF to evaluate local linear XAI methods

Arxiv

0+阅读 · 2021年6月1日

Model-based Adversarial Meta-Reinforcement Learning

Arxiv

5+阅读 · 2020年6月16日

Pretrained Transformers Improve Out-of-Distribution Robustness

Arxiv

5+阅读 · 2020年4月13日

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Meta-World: A Benchmark and Evaluation for Multi-Task and Meta Reinforcement Learning

Arxiv

34+阅读 · 2019年10月24日

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Towards Automated Machine Learning: Evaluation and Comparison of AutoML Approaches and Tools

Arxiv

3+阅读 · 2019年9月3日

Predicting ConceptNet Path Quality Using Crowdsourced Assessments of Naturalness

Arxiv

3+阅读 · 2019年2月21日

A Framework for Evaluating 6-DOF Object Trackers

Arxiv

6+阅读 · 2018年3月28日

微信扫码咨询专知VIP会员