特征归属方法的有效性及其与自动评价分数的相关性 (The effectiveness of feature attribution methods and its correlation with automatic evaluation scores) - 专知论文

会员服务 ·

0

相关系数 · Performer · 得分 · TOOLS · TEAM ·

2022 年 1 月 27 日

The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

翻译：特征归属方法的有效性及其与自动评价分数的相关性

Giang Nguyen,Daeyoung Kim,Anh Nguyen

from arxiv, NeurIPS 2021; 10 pages of Main text; 28 pages of Appendix

Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications. Hundreds of papers have either proposed new feature attribution methods, discussed or harnessed these tools in their work. However, despite humans being the target end-users, most attribution methods were only evaluated on proxy automatic-evaluation metrics (Zhang et al. 2018; Zhou et al. 2016; Petsiuk et al. 2018). In this paper, we conduct the first user study to measure attribution map effectiveness in assisting humans in ImageNet classification and Stanford Dogs fine-grained classification, and when an image is natural or adversarial (i.e., contains adversarial perturbations). Overall, feature attribution is surprisingly not more effective than showing humans nearest training-set examples. On a harder task of fine-grained dog categorization, presenting attribution maps to humans does not help, but instead hurts the performance of human-AI teams compared to AI alone. Importantly, we found automatic attribution-map evaluation measures to correlate poorly with the actual human-AI team performance. Our findings encourage the community to rigorously test their methods on the downstream human-in-the-loop applications and to rethink the existing evaluation metrics.

翻译：解释人工智能模式决定在许多现实世界、高比例的应用中越来越重要。数百篇论文要么提出了新的特征归属方法,讨论或在其工作中利用了这些工具。然而,尽管人是目标终端用户,但大多数属性方法仅根据代理自动评估指标进行评估(张等人,2018年;周等人,2016年;佩西乌克等人,2018年)。在本文中,我们进行了第一次用户研究,以衡量在图像网络分类和斯坦福狗类精细分类中帮助人类的归属地图效力,以及当图像是自然或对抗性的(即含有对抗性干扰)。总体而言,特征归属并不比展示人类最近的训练范例更为有效。在细微的狗类分类这一更艰巨的任务中,向人类提供归属图无助于,反而伤害了人类个体团队的业绩。重要的是,我们发现自动属性映射评价措施与实际人类-AI团队业绩的相关性差强。我们的调查结果鼓励社区严格地测试其下游评估方法。我们鼓励社区在下游评估中严格地检验其标准应用。

0

相关内容

相关系数

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

玉米抗甘蔗花叶病毒病基因Scmv1的功能和抗病机理

国家自然科学基金

0+阅读 · 2012年12月31日

结构物入水冲击动力学问题的DSPH计算方法和试验研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于表达残差稀疏性的遮挡人脸识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向GPU的体系结构敏感型数值算法优化技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

对称高密度紧小波框架及其在机械故障诊断中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

克里佛德代数结构框架下高维空间中若干问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

高通量基因数据分析中的 Bayes 统计方法

国家自然科学基金

1+阅读 · 2008年12月31日

Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive Summarization

Arxiv

0+阅读 · 2022年4月20日

Test suite effectiveness metric evaluation: what do we know and what should we do?

Arxiv

0+阅读 · 2022年4月19日

UID2021: An Underwater Image Dataset for Evaluation of No-reference Quality Assessment Metrics

Arxiv

0+阅读 · 2022年4月19日

The Importance of Landscape Features for Performance Prediction of Modular CMA-ES Variants

The Importance of Landscape Features for Performance Prediction of Modular CMA-ES Variants

Arxiv

0+阅读 · 2022年4月15日

Automated Test-Case Generation for Solidity Smart Contracts: the AGSolT Approach and its Evaluation

Arxiv

0+阅读 · 2022年4月15日

Do Feature Attribution Methods Correctly Attribute Features?

Arxiv

15+阅读 · 2021年12月15日

Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Arxiv

10+阅读 · 2021年10月4日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

VIP会员

文章信息

相关主题

相关VIP内容

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

【视频描述综述论文】Video Description: A Survey of Methods, Datasets, and Evaluation Metrics

专知会员服务

65+阅读 · 2020年5月12日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《解析陆域作战方向：一个概念性框架》报告

《人工智能与人类的未来》2025年最新300页书籍

追寻真正的AI自主性：从遗留思维到战场优势

《“蛛网”行动：乌克兰不对称作战的演进》报告

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive Summarization

Arxiv

0+阅读 · 2022年4月20日

Test suite effectiveness metric evaluation: what do we know and what should we do?

Arxiv

0+阅读 · 2022年4月19日

UID2021: An Underwater Image Dataset for Evaluation of No-reference Quality Assessment Metrics

Arxiv

0+阅读 · 2022年4月19日

The Importance of Landscape Features for Performance Prediction of Modular CMA-ES Variants

The Importance of Landscape Features for Performance Prediction of Modular CMA-ES Variants

Arxiv

0+阅读 · 2022年4月15日

Automated Test-Case Generation for Solidity Smart Contracts: the AGSolT Approach and its Evaluation

Arxiv

0+阅读 · 2022年4月15日

Do Feature Attribution Methods Correctly Attribute Features?

Arxiv

15+阅读 · 2021年12月15日

Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information

Arxiv

10+阅读 · 2021年10月4日

Adaptive Methods for Real-World Domain Generalization

Arxiv

13+阅读 · 2021年3月29日

Text Detection and Recognition in the Wild: A Review

Arxiv

20+阅读 · 2020年6月8日

The Unreasonable Effectiveness of Deep Features as a Perceptual Metric

Arxiv

11+阅读 · 2018年1月11日

相关基金

可积系统的代数与几何结构

国家自然科学基金

0+阅读 · 2013年12月31日

玉米抗甘蔗花叶病毒病基因Scmv1的功能和抗病机理

国家自然科学基金

0+阅读 · 2012年12月31日

结构物入水冲击动力学问题的DSPH计算方法和试验研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于表达残差稀疏性的遮挡人脸识别方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

大肠杆菌K1外膜蛋白A特异结构在其导致新生儿细菌性脑膜炎中的作用研究

国家自然科学基金

0+阅读 · 2012年12月31日

面向GPU的体系结构敏感型数值算法优化技术研究

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

对称高密度紧小波框架及其在机械故障诊断中的应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

克里佛德代数结构框架下高维空间中若干问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

高通量基因数据分析中的 Bayes 统计方法

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员