使用高斯进程模型检测神经网络的错误分类错误 (Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model) - 专知论文

会员服务 ·

0

基分类器 · Neural Networks · Networking · 置信度 · 得分 ·

2021 年 5 月 28 日

Detecting Misclassification Errors in Neural Networks with a Gaussian Process Model

翻译：使用高斯进程模型检测神经网络的错误分类错误

Xin Qiu,Risto Miikkulainen

from arxiv, 32 pages, 3 figures, 15 tables

As neural network classifiers are deployed in real-world applications, it is crucial that their failures can be detected reliably. One practical solution is to assign confidence scores to each prediction, then use these scores to filter out possible misclassifications. However, existing confidence metrics are not yet sufficiently reliable for this role. This paper presents a new framework that produces a quantitative metric for detecting misclassification errors. This framework, RED, builds an error detector on top of the base classifier and estimates uncertainty of the detection scores using Gaussian Processes. Experimental comparisons with other error detection methods on 125 UCI datasets demonstrate that this approach is effective. Further implementations on two probabilistic base classifiers and two large deep learning architecture in vision tasks further confirm that the method is robust and scalable. Third, an empirical analysis of RED with out-of-distribution and adversarial samples shows that the method can be used not only to detect errors but also to understand where they come from. RED can thereby be used to improve trustworthiness of neural network classifiers more broadly in the future.

翻译：由于神经网络分类器被部署在现实世界的应用中,因此关键在于能否可靠地检测出它们的故障。一个实际的解决办法是给每个预测分配信任分数,然后用这些分数来过滤可能的错误分类。然而,现有的信心度量对于这一作用来说还不够可靠。本文提出了一个新的框架,为检测错误分类错误提供了定量度量。这个框架,RED,在基级分类器之上建立一个错误检测器,并用高森进程估算探测分数的不确定性。在125 UCI数据集中与其他错误检测方法的实验性比较表明,这一方法是有效的。进一步实施两个概率基分解器和两个大型的视觉任务深层学习结构进一步证实,该方法是稳健和可缩放的。第三,对RED的外部和对抗性样本进行的经验分析表明,该方法不仅可用于检测错误,而且可用于了解它们来自何处。因此,RED可以用来提高神经网络分类器在未来更为广泛范围内的可信度。

0

相关内容

基分类器

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

机器视觉在织物疵点检测上的应用研究综述，Analysis on Application of Machine Vision in Fabric Defect Detection

机器视觉在织物疵点检测上的应用研究综述，Analysis on Application of Machine Vision in Fabric Defect Detection

专知会员服务

18+阅读 · 2020年2月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

Arxiv

1+阅读 · 2021年7月20日

Variational Representations and Neural Network Estimation of Rényi Divergences

Arxiv

0+阅读 · 2021年7月20日

Anchor Pruning for Object Detection

Arxiv

0+阅读 · 2021年7月20日

Confidence Aware Neural Networks for Skin Cancer Detection

Arxiv

0+阅读 · 2021年7月19日

Uncertainty-aware Cardinality Estimation by Neural Network Gaussian Process

Arxiv

0+阅读 · 2021年7月19日

Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability

Arxiv

0+阅读 · 2021年7月18日

ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks

Arxiv

0+阅读 · 2021年7月16日

Adversarial Attack for Uncertainty Estimation: Identifying Critical Regions in Neural Networks

Arxiv

0+阅读 · 2021年7月15日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

【深度伪造综述论文】The Creation and Detection of Deepfakes: A Survey

专知会员服务

55+阅读 · 2020年4月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

机器视觉在织物疵点检测上的应用研究综述，Analysis on Application of Machine Vision in Fabric Defect Detection

机器视觉在织物疵点检测上的应用研究综述，Analysis on Application of Machine Vision in Fabric Defect Detection

专知会员服务

18+阅读 · 2020年2月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【深度学习视频分析/多模态学习资源大列表】

【深度学习视频分析/多模态学习资源大列表】

专知会员服务

92+阅读 · 2019年10月16日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

计算机视觉最佳实践、代码示例和相关文档

计算机视觉最佳实践、代码示例和相关文档

专知会员服务

20+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】以人为中心的强化学习

任务规划与地形分析：现代复杂环境作战导航体系

认知优势：人工智能在国家安全决策中的核心作用

大模型赋能的具身智能：决策与具身学习综述

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【推荐】深度学习目标检测全面综述

【推荐】深度学习目标检测全面综述

机器学习研究会

21+阅读 · 2017年9月13日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays

Arxiv

1+阅读 · 2021年7月20日

Variational Representations and Neural Network Estimation of Rényi Divergences

Arxiv

0+阅读 · 2021年7月20日

Anchor Pruning for Object Detection

Arxiv

0+阅读 · 2021年7月20日

Confidence Aware Neural Networks for Skin Cancer Detection

Arxiv

0+阅读 · 2021年7月19日

Uncertainty-aware Cardinality Estimation by Neural Network Gaussian Process

Arxiv

0+阅读 · 2021年7月19日

Differentially Private Bayesian Neural Networks on Accuracy, Privacy and Reliability

Arxiv

0+阅读 · 2021年7月18日

ECG-Adv-GAN: Detecting ECG Adversarial Examples with Conditional Generative Adversarial Networks

Arxiv

0+阅读 · 2021年7月16日

Adversarial Attack for Uncertainty Estimation: Identifying Critical Regions in Neural Networks

Arxiv

0+阅读 · 2021年7月15日

Relation Networks for Object Detection

Arxiv

4+阅读 · 2018年6月14日

Neural Models for Key Phrase Detection and Question Generation

Arxiv

4+阅读 · 2018年5月30日

微信扫码咨询专知VIP会员