Surrogate models (including deep neural networks and other machine learning algorithms in supervised learning) are capable of approximating arbitrarily complex, high-dimensional input-output problems in science and engineering, but require a thorough data-agnostic uncertainty quantification analysis before these can be deployed for any safety-critical application. The standard approach for data-agnostic uncertainty quantification is to use conformal prediction (CP), a well-established framework to build uncertainty models with proven statistical guarantees that do not assume any shape for the error distribution of the surrogate model. However, since the classic statistical guarantee offered by CP is given in terms of bounds for the marginal coverage, for small calibration set sizes (which are frequent in realistic surrogate modelling that aims to quantify error at different regions), the potentially strong dispersion of the coverage distribution around its average negatively impacts the reliability of the uncertainty model, often obtaining coverages below the expected value, resulting in a less applicable framework. After providing a gentle presentation of uncertainty quantification for surrogate models for machine learning practitioners, in this paper we bridge the gap by proposing a new statistical guarantee that offers probabilistic information for the coverage of a single conformal predictor. We show that the proposed framework converges to the standard solution offered by CP for large calibration set sizes and, unlike the classic guarantee, still offers reliable information about the coverage of a conformal predictor for small data sizes. We illustrate and validate the methodology in a suite of examples, and implement an open access software solution that can be used alongside common conformal prediction libraries to obtain uncertainty models that fulfil the new guarantee.
翻译:代理模型(包括监督学习中的深度神经网络及其他机器学习算法)能够近似科学与工程中任意复杂的高维输入-输出问题,但在部署至任何安全关键应用前,需进行彻底的数据无关不确定性量化分析。数据无关不确定性量化的标准方法是采用保形预测(CP)——一个成熟的框架,可构建具有严格统计保证的不确定性模型,且无需假设代理模型误差分布的任何形态。然而,由于CP提供的经典统计保证是基于边际覆盖率的边界给出的,对于较小的校准集规模(这在旨在量化不同区域误差的现实代理建模中十分常见),覆盖率分布可能围绕其均值出现强烈离散,从而影响不确定性模型的可靠性,常导致实际覆盖率低于预期值,降低了该框架的适用性。本文首先为机器学习从业者简要介绍代理模型的不确定性量化,随后通过提出一种新的统计保证来弥合现有差距,该保证可为单个保形预测器的覆盖率提供概率信息。我们证明,所提框架在大规模校准集下收敛于CP提供的标准解,且与经典保证不同,其仍能为小数据规模下保形预测器的覆盖率提供可靠信息。我们通过一系列案例演示并验证了该方法,并实现了一个开源软件解决方案,可与常见保形预测库结合使用,以获得满足新统计保证的不确定性模型。