Inductive Conformal Prediction (ICP) is a set of distribution-free and model agnostic algorithms devised to predict with a user-defined confidence with coverage guarantee. Instead of having \textit{point predictions}, i.e., a real number in the case of regression or a single class in multi class classification, models calibrated using ICP output an interval or a set of classes, respectively. ICP takes special importance in high-risk settings where we want the real output to belong to the prediction set with high probability. As an example, a classification model might output that given a magnetic resonance image a patient has no latent diseases to report. However, this model output was based on the most likely class, the second most likely class might tell that the patient has a 15\% chance of brain tumor or other severe disease and therefore further exams should be conducted. Using ICP is therefore way more informative and we believe that should be the standard way of producing forecasts. This paper is a hands-on introduction, this means that we will provide examples as we introduce the theory.
翻译:诱导性预测(ICP)是一套无分布式和模型性不可知算法,设计这些算法是为了以用户定义的自信和覆盖保证进行预测。它不是要提供\ textit{point 预测},而是要提供回归的真实数字或多级分类中的单类,即分别使用比较方案输出的间隔或一组分类校准的模型。比较方案在高风险环境中特别重要,因为我们希望真实产出属于高概率的预测组。举例来说,分类模型可能会输出出一个磁共振图像,使病人没有隐性疾病需要报告。然而,这一模型输出是根据最有可能的类别,第二大可能显示病人有15 ⁇ 脑肿瘤或其他严重疾病的可能性,因此应该进行进一步测试。因此,使用比较方案可以提供更丰富的信息,我们认为,这应该是产生预报的标准方式。本文是一个亲手介绍,这意味着我们将在介绍理论时提供实例。