This paper provides both an introduction to and a detailed overview of the principles and practice of classifier calibration. A well-calibrated classifier correctly quantifies the level of uncertainty or confidence associated with its instance-wise predictions. This is essential for critical applications, optimal decision making, cost-sensitive classification, and for some types of context change. Calibration research has a rich history which predates the birth of machine learning as an academic field by decades. However, a recent increase in the interest on calibration has led to new methods and the extension from binary to the multiclass setting. The space of options and issues to consider is large, and navigating it requires the right set of concepts and tools. We provide both introductory material and up-to-date technical details of the main concepts and methods, including proper scoring rules and other evaluation metrics, visualisation approaches, a comprehensive account of post-hoc calibration methods for binary and multiclass classification, and several advanced topics.
翻译:本文件介绍并详细概述了分类校准的原则和做法。一个经过充分校准的分类师正确地量化了与其实例预测相关的不确定性或信任度。这对于关键应用、最佳决策、成本敏感分类以及某些类型的背景变化至关重要。校准研究有着丰富的历史,在机器学习作为一个学术领域诞生几十年之前就已存在。然而,最近校准的兴趣增加导致采用了新方法,从二进制延伸到多级设置。要考虑的选项和问题空间是很大的,在浏览时需要一套正确的概念和工具。我们提供了主要概念和方法的介绍性材料和最新技术细节,包括适当的评分规则和其他评价指标、可视化方法、二进制和多级分类后校准方法的全面说明,以及若干先进的专题。