Most machine learning classifiers only concern classification accuracy, while certain applications (such as medical diagnosis, meteorological forecasting, and computation advertising) require the model to predict the true probability, known as a calibrated estimate. In previous work, researchers have developed several calibration methods to post-process the outputs of a predictor to obtain calibrated values, such as binning and scaling methods. Compared with scaling, binning methods are shown to have distribution-free theoretical guarantees, which motivates us to prefer binning methods for calibration. However, we notice that existing binning methods have several drawbacks: (a) the binning scheme only considers the original prediction values, thus limiting the calibration performance; and (b) the binning approach is non-individual, mapping multiple samples in a bin to the same value, and thus is not suitable for order-sensitive applications. In this paper, we propose a feature-aware binning framework, called Multiple Boosting Calibration Trees (MBCT), along with a multi-view calibration loss to tackle the above issues. Our MBCT optimizes the binning scheme by the tree structures of features, and adopts a linear function in a tree node to achieve individual calibration. Our MBCT is non-monotonic, and has the potential to improve order accuracy, due to its learnable binning scheme and the individual calibration. We conduct comprehensive experiments on three datasets in different fields. Results show that our method outperforms all competing models in terms of both calibration error and order accuracy. We also conduct simulation experiments, justifying that the proposed multi-view calibration loss is a better metric in modeling calibration error.
翻译:大多数机器学习分类器仅涉及分类准确性,而某些应用(如医学诊断、气象预报和计算广告)要求模型预测真实概率,称为校准估计。在以往的工作中,研究人员开发了几种校准方法,用于处理预测器输出后处理校准值,如宾式和缩放方法。与比例缩放相比,宾式方法显示为无分配性理论保证,这促使我们偏爱校准的宾式方法。然而,我们注意到,现有的宾式方法有几个缺点:(a) 宾式方法只考虑最初的预测值,从而限制多校准性能;以及(b) 宾式方法是非个体性的,将多个样本绘制成同一个数值,因此不适合对定序敏感的应用。在本文中,我们建议了一个不使用多级调校准校正树的仪表框架(MBCT), 以及我们提出的三次校准的校正方法, 正在以不同的校正方法, 在不使用树形的模型中, 将自己的校正的校正方法 。