评价大赦国际解释方法的计量方法 -- -- 图像分类任务中的有线电视新闻网 (Reference-based and No-reference Metrics to Evaluate Explanation Methods of AI -- CNNs in Image Classification Tasks) - 专知论文

会员服务 ·

0

图片分类 · 相关系数 · 真实值 · TOOLS · AI ·

2023 年 1 月 10 日

Reference-based and No-reference Metrics to Evaluate Explanation Methods of AI -- CNNs in Image Classification Tasks

翻译：评价大赦国际解释方法的计量方法 -- -- 图像分类任务中的有线电视新闻网

A. Zhukov,J. Benois-Pineau,R. Giot

from arxiv, Typos corrected, Introduction corrected, Experimental protocol corrected, Results corrected; 25 pages, 16 tables, 16 figures; Submitted to "Advances in Artificial Intelligence and Machine Learning" (ISSN: 2582-9793)

The most popular methods in AI-machine learning paradigm are mainly black boxes. This is why explanation of AI decisions is of emergency. Although dedicated explanation tools have been massively developed, the evaluation of their quality remains an open research question. In this paper, we generalize the methodologies of evaluation of post-hoc explainers of CNNs' decisions in visual classification tasks with reference and no-reference based metrics. We apply them on our previously developed explainers (FEM, MLFEM), and popular Grad-CAM. The reference-based metrics are Pearson correlation coefficient and Similarity computed between the explanation map and its ground truth represented by a Gaze Fixation Density Map obtained with a psycho-visual experiment. As a no-reference metric, we use stability metric, proposed by Alvarez-Melis and Jaakkola. We study its behaviour, consensus with reference-based metrics and show that in case of several kinds of degradation on input images, this metric is in agreement with reference-based ones. Therefore, it can be used for evaluation of the quality of explainers when the ground truth is not available.

翻译：AI-Mach学习模式中最受欢迎的方法主要是黑盒。这就是为什么解释AI决定是紧急的。尽管专门的解释工具已经大规模开发,但其质量评价仍然是一个开放的研究问题。在本文中,我们以参考和无参考基准的衡量标准,推广了CNN决定视觉分类任务后热解解释器的评价方法。我们将其应用于我们以前开发的解释器(FEM、MLFEM)和广受欢迎的 Grad-CAM。基于参考的衡量标准是Pearson相关系数和以心理-视觉实验获得的Gaze 固定密度地图所显示的解释地图及其地面真相之间的相似性。作为一个不参考指标,我们使用Alvarez-Melis和Jaakkola提出的稳定性指标。我们研究其行为,与基于参考的衡量标准达成共识,并表明在输入图像出现几种退化的情况下,该指标与基于参考的数据一致。因此,在无法获得地面真相时,可以用来评价解释者的质量。

0

相关内容

图片分类

图像分类，顾名思义，是一个输入图像，输出对该图像内容分类的描述的问题。它是计算机视觉的核心，实际应用广泛。

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

介孔复合微纳结构CaTi2O5的可控制备及光催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

碳/碳复合材料热梯度水热碳化沉积机理及结构调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

A4Zr3O12陶瓷材料的高温辐照损伤机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

稀土配合物修饰石墨烯量子点复合发光材料的合成及光电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

界面类型对多层膜材料耐辐照损伤能力的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型轻质高温γ1 +γ双相TiAl-Nb金属间化合物的强韧化机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于超分子手性的非心对称配体配位聚集体的设计组装、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳化硅基陶瓷材料高温相平衡研究

国家自然科学基金

0+阅读 · 2009年12月31日

Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning

Arxiv

0+阅读 · 2023年3月7日

User Evaluation of Culture-to-Culture Image Translation with Generative Adversarial Nets

Arxiv

0+阅读 · 2023年3月6日

Evaluation of Interpretability Methods and Perturbation Artifacts in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月6日

Motion-based extrinsic sensor-to-sensor calibration: Effect of reference frame selection for new and existing methods

Arxiv

0+阅读 · 2023年3月6日

Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT

Arxiv

0+阅读 · 2023年3月6日

Detecting Differences Between Correlation-Matrix Populations due to Single-variable Perturbations, with Application to Resting State fMRI

Arxiv

0+阅读 · 2023年3月3日

On The Coherence of Quantitative Evaluation of Visual Explanations

Arxiv

1+阅读 · 2023年3月3日

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

机器人领域中最佳的三维场景表示是什么？——从几何表示到基础模型

《多域作战兵棋推演：运用形态学分析与人工智能加强国防人员训练》

【博士论文】快速高效的归一化流及其在图像生成模型中的应用

仿生机器人技术的军事应用

相关资讯

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

【论文推荐】最新七篇图像分割相关论文—Attention U-Net、对抗结构匹配损失、卷积CRFs、对抗样本、弱监督分割

专知

19+阅读 · 2018年5月31日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

相关论文

Diminishing Return of Value Expansion Methods in Model-Based Reinforcement Learning

Arxiv

0+阅读 · 2023年3月7日

User Evaluation of Culture-to-Culture Image Translation with Generative Adversarial Nets

Arxiv

0+阅读 · 2023年3月6日

Evaluation of Interpretability Methods and Perturbation Artifacts in Deep Neural Networks

Arxiv

0+阅读 · 2023年3月6日

Motion-based extrinsic sensor-to-sensor calibration: Effect of reference frame selection for new and existing methods

Arxiv

0+阅读 · 2023年3月6日

Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT

Arxiv

0+阅读 · 2023年3月6日

Detecting Differences Between Correlation-Matrix Populations due to Single-variable Perturbations, with Application to Resting State fMRI

Arxiv

0+阅读 · 2023年3月3日

On The Coherence of Quantitative Evaluation of Visual Explanations

Arxiv

1+阅读 · 2023年3月3日

A Survey on Graph Counterfactual Explanations: Definitions, Methods, Evaluation

Arxiv

12+阅读 · 2022年10月21日

A continual learning survey: Defying forgetting in classification tasks

Arxiv

32+阅读 · 2021年4月16日

Self-Supervised Learning For Few-Shot Image Classification

Self-Supervised Learning For Few-Shot Image Classification

Arxiv

19+阅读 · 2019年11月14日

相关基金

介孔复合微纳结构CaTi2O5的可控制备及光催化性能研究

国家自然科学基金

0+阅读 · 2015年12月31日

偕二氟取代Combretastatins衍生物的设计与合成

国家自然科学基金

0+阅读 · 2014年12月31日

Anderson型多酸的不对称修饰及可控组装研究

国家自然科学基金

1+阅读 · 2014年12月31日

碳/碳复合材料热梯度水热碳化沉积机理及结构调控研究

国家自然科学基金

0+阅读 · 2013年12月31日

A4Zr3O12陶瓷材料的高温辐照损伤机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

稀土配合物修饰石墨烯量子点复合发光材料的合成及光电性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

界面类型对多层膜材料耐辐照损伤能力的影响及其机制

国家自然科学基金

0+阅读 · 2012年12月31日

新型轻质高温γ1 +γ双相TiAl-Nb金属间化合物的强韧化机制

国家自然科学基金

0+阅读 · 2011年12月31日

基于超分子手性的非心对称配体配位聚集体的设计组装、结构调控与性能研究

国家自然科学基金

0+阅读 · 2011年12月31日

碳化硅基陶瓷材料高温相平衡研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员