以 KCal 制步后退: 深神经网络的多级内核校准 (Taking a Step Back with KCal: Multi-Class Kernel-Based Calibration for Deep Neural Networks)

Deep neural network (DNN) classifiers are often overconfident, producing miscalibrated class probabilities. In high-risk applications like healthcare, practitioners require $\textit{fully calibrated}$ probability predictions for decision-making. That is, conditioned on the prediction $\textit{vector}$, $\textit{every}$ class' probability should be close to the predicted value. Most existing calibration methods either lack theoretical guarantees for producing calibrated outputs, reduce classification accuracy in the process, or only calibrate the predicted class. This paper proposes a new Kernel-based calibration method called KCal. Unlike existing calibration procedures, KCal does not operate directly on the logits or softmax outputs of the DNN. Instead, KCal learns a metric space on the penultimate-layer latent embedding and generates predictions using kernel density estimates on a calibration set. We first analyze KCal theoretically, showing that it enjoys a provable $\textit{full}$ calibration guarantee. Then, through extensive experiments across a variety of datasets, we show that KCal consistently outperforms baselines as measured by the calibration error and by proper scoring rules like the Brier Score.

翻译：深心神经网络( DNNN) 分类者往往过于自信, 产生错误校准的分类概率。在医疗等高风险应用中, 执业者需要美元/ textit{ 完全校准} 美元概率预测来做决策。也就是说, 以 $\ textit{ victor} 美元为条件, $\ textit{ every} 类概率应该接近预测值。大部分现有的校准方法要么缺乏提供校准产出的理论保障, 降低校准的准确性, 要么只校准预测的等级。本文提出了一种新的基于克朗的校准方法, 名为 KCal 。与现有的校准程序不同, KCal 并不直接在 DNN 的日志或软模输出上操作。相反, KCal 学习了点定层潜在嵌入的计量空间, 并使用校准集的内核密度估计值进行预测。我们首先从理论上分析 KCal, 显示它享有一个可校准的 $\ full} 校准保证值。然后, 通过测量各种的校正规则的大规模实验, 我们显示K 显示, 不断的校正。

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日