重新审查现代神经网络的校准 (Revisiting the Calibration of Modern Neural Networks)

Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.

翻译：对预测不确定性的准确估计(模型校准)对于神经网络的安全应用至关重要。许多现代神经网络的错误校准案例已经报告,表明较新的、更精确的模型会产生校准错误的预测。在这里,我们重新审视最近最先进的图像分类模型的这一问题。我们系统地将模型校准和准确性联系起来,发现最新的模型,特别是那些没有使用变异模型的模型,属于最佳校准范围。前几代模型中观察到的趋势,如分布变换或模型大小的校准衰败,在最近的建筑中并不那么明显。我们还表明,模型的规模和训练前数量不能充分解释这些差异,这表明建筑是校准特性的一个主要决定因素。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/