Inductive Conformal Prediction (ICP) is a set of distribution-free and model agnostic algorithms devised to predict with a user-defined confidence with coverage guarantee. Instead of having point predictions, i.e., a real number in the case of regression or a single class in multi class classification, models calibrated using ICP output an interval or a set of classes, respectively. ICP takes special importance in high-risk settings where we want the true output to belong to the prediction set with high probability. As an example, a classification model might output that given a magnetic resonance image a patient has no latent diseases to report. However, this model output was based on the most likely class, the second most likely class might tell that the patient has a 15% chance of brain tumor or other severe disease and therefore further exams should be conducted. Using ICP is therefore way more informative and we believe that should be the standard way of producing forecasts. This paper is a hands-on introduction, this means that we will provide examples as we introduce the theory.
翻译:诱导性预测(ICP)是一套无分布式和模型性不可知算法,设计这些算法是为了以用户定义的自信和覆盖保证进行预测。它没有点预测,即回归或多类分类中的单类的真实数字,而是分别使用比较方案输出的间隔或一组类别校准的模型。比较方案在高风险环境中特别重要,因为我们希望真实产出属于高概率的预测组。举例来说,分类模型的输出可能给出一个磁共振图像,患者没有隐性疾病可报告。然而,这一模型输出基于最有可能的类别,第二个最有可能的类别可能显示病人有15%的脑肿瘤或其他严重疾病的可能性,因此应该进行进一步测试。因此,使用比较方案可以提供更丰富的信息,我们认为这应该是产生预报的标准方式。这份文件是一个亲手介绍,这意味着我们将在介绍理论时提供实例。