通过正规化对深神经网络进行校准及其对精炼的影响 (On Deep Neural Network Calibration by Regularization and its Impact on Refinement)

Deep neural networks have been shown to be highly miscalibrated. often they tend to be overconfident in their predictions. It poses a significant challenge for safety-critical systems to utilise deep neural networks (DNNs), reliably. Many recently proposed approaches to mitigate this have demonstrated substantial progress in improving DNN calibration. However, they hardly touch upon refinement, which historically has been an essential aspect of calibration. Refinement indicates separability of a network's correct and incorrect predictions. This paper presents a theoretically and empirically supported exposition reviewing refinement of a calibrated model. Firstly, we show the breakdown of expected calibration error (ECE), into predicted confidence and refinement under the assumption of over-confident predictions. Secondly, linking with this result, we highlight that regularization based calibration only focuses on naively reducing a model's confidence. This logically has a severe downside to a model's refinement as correct and incorrect predictions become tightly coupled. Lastly, connecting refinement with ECE also provides support to existing refinement based approaches which improve calibration but do not explain the reasoning behind it. We support our claims through rigorous empirical evaluations of many state of the art calibration approaches on widely used datasets and neural networks. We find that many calibration approaches with the likes of label smoothing, mixup etc. lower the usefulness of a DNN by degrading its refinement. Even under natural data shift, this calibration-refinement trade-off holds for the majority of calibration methods.

翻译：深心神经网络被证明高度错误校正。深心神经网络被证明是高度错误的。它们往往在预测中过于自信。它给安全临界系统利用深心神经网络带来重大挑战, 并可靠地利用深心神经网络(DNN)带来重大挑战。最近提出的许多缓解方法表明,在改进DN校准方面已取得重大进展。但是,它们几乎没有触及改进,这在历史上一直是校准的一个基本方面。精炼表明,一个网络正确和不正确的预测具有分离性。本文介绍了一个理论上和经验上支持的演示,以审查一个校准模型的完善情况。首先,我们展示了预期校准错误(ECE)的崩溃,在假设过分自信预测的情况下,将它变成预期的信任和完善。第二,将这一结果联系起来,我们强调,基于校正的规范仅仅侧重于天真地降低模型的信心。这在逻辑上与模型的改进有严重的下下坡。最后,与欧洲经委会的改进也支持现有的改进方法,这些改进方法改进了校准,但却没有解释它背后的推理。我们支持我们主张,通过严格地调整多数贸易网络的调整,我们采用了标准化的校准方法。采用了很多的校准方法。

相关内容

Neural Networks

关注 1648

神经网络（Neural Networks）是世界上三个最古老的神经建模学会的档案期刊:国际神经网络学会(INNS)、欧洲神经网络学会(ENNS)和日本神经网络学会(JNNS)。神经网络提供了一个论坛，以发展和培育一个国际社会的学者和实践者感兴趣的所有方面的神经网络和相关方法的计算智能。神经网络欢迎高质量论文的提交，有助于全面的神经网络研究，从行为和大脑建模，学习算法，通过数学和计算分析，系统的工程和技术应用，大量使用神经网络的概念和技术。这一独特而广泛的范围促进了生物和技术研究之间的思想交流，并有助于促进对生物启发的计算智能感兴趣的跨学科社区的发展。因此，神经网络编委会代表的专家领域包括心理学，神经生物学，计算机科学，工程，数学，物理。该杂志发表文章、信件和评论以及给编辑的信件、社论、时事、软件调查和专利信息。文章发表在五个部分之一:认知科学，神经科学，学习系统，数学和计算分析、工程和应用。官网地址：http://dblp.uni-trier.de/db/journals/nn/

对比学习简述

专知会员服务

90+阅读 · 2021年6月29日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日