Deep neural networks (DNN) are prone to miscalibrated predictions, often exhibiting a mismatch between the predicted output and the associated confidence scores. Contemporary model calibration techniques mitigate the problem of overconfident predictions by pushing down the confidence of the winning class while increasing the confidence of the remaining classes across all test samples. However, from a deployment perspective, an ideal model is desired to (i) generate well-calibrated predictions for high-confidence samples with predicted probability say >0.95, and (ii) generate a higher proportion of legitimate high-confidence samples. To this end, we propose a novel regularization technique that can be used with classification losses, leading to state-of-the-art calibrated predictions at test time; From a deployment standpoint in safety-critical applications, only high-confidence samples from a well-calibrated model are of interest, as the remaining samples have to undergo manual inspection. Predictive confidence reduction of these potentially ``high-confidence samples'' is a downside of existing calibration approaches. We mitigate this by proposing a dynamic train-time data pruning strategy that prunes low-confidence samples every few epochs, providing an increase in "confident yet calibrated samples". We demonstrate state-of-the-art calibration performance across image classification benchmarks, reducing training time without much compromise in accuracy. We provide insights into why our dynamic pruning strategy that prunes low-confidence training samples leads to an increase in high-confidence samples at test time.
翻译:深心神经网络(DNN)容易被错误校准预测,常常显示预测产出与相关信任分数之间的不匹配。当代模型校准技术通过降低中产阶级的信心,同时提高所有测试样品中其余阶级的信心,减轻了过度自信预测的问题。然而,从部署角度而言,理想模型希望(一) 为预测概率为>0.95的高度自信样本产生经适当校准的预测,以及(二) 产生更高比例的合法高自信样本。为此,我们提出一种新的正规化技术,可用于分类损失,导致在测试时间作出最先进的校准预测;从安全临界应用中的部署角度来看,只有从一个校准模型中得出的高自信样本才有意义,因为其余样本需要进行人工检查。这些可能具有高度自信的样本的预测性降低信心是现有校准方法的下行点。我们通过提出动态的火车时间校准策略来减轻这一风险,从而在测试时得出最先进的预测,从而导致在测试时间上的精确度测试中,在每次测试中,我们能够对每个州级的样本进行大幅校准测试,从而测量“在精确度标定“我们”的精确度测试中,我们如何在精确地标中提高各种图像标标标标标标标中,从而在每一阶段的标定标定标定标定标准中,我们为“在不同的标定的每个标定标定标定标定的进度中,在每一标定标定标定的每个标定的每个标定的进度中,我们。”