Conformal prediction is a widely used method to quantify uncertainty in settings where the data is independent and identically distributed (IID), or more generally, exchangeable. Conformal prediction takes in a pre-trained classifier, a calibration dataset and a confidence level as inputs, and returns a function which maps feature vectors to subsets of classes. The output of the returned function for a new feature vector (i.e., a test data point) is guaranteed to contain the true class with the pre-specified confidence. Despite its success and usefulness in IID settings, extending conformal prediction to non-exchangeable (e.g., Markovian) data in a manner that provably preserves all desirable theoretical properties has largely remained an open problem. As a solution, we extend conformal prediction to the setting of a Hidden Markov Model (HMM) with unknown parameters. The key idea behind the proposed method is to partition the non-exchangeable Markovian data from the HMM into exchangeable blocks by exploiting the de Finetti's Theorem for Markov Chains discovered by Diaconis and Freedman (1980). The permutations of the exchangeable blocks are then viewed as randomizations of the observed Markovian data from the HMM. The proposed method provably retains all desirable theoretical guarantees offered by the classical conformal prediction framework and is general enough to be useful in many sequential prediction problems.
翻译:共变预测是一种广泛使用的方法,用于量化数据独立和同样分布(IID)或更普遍地可以互换的环境下的不确定性; 共变预测以预先训练的分类器、校准数据集和信任度作为输入,并返回一个功能,用于将矢量映射到类别子集。 将新特性矢量(即测试数据点)的返回函数输出保证包含有预先指定的信任度的真实类别。 尽管它在 IID 设置中取得了成功和有用,但将符合的预测扩大到非交换性数据(如Markovian),从而以可实现的保存所有理想理论属性的方式,这在很大程度上仍然是一个尚未解决的问题。 作为一种解决办法,我们将符合的预测扩展至具有未知参数的隐藏马尔科夫模型(HMM)的设置。 拟议方法的关键理念是利用足够多的 De Finetti 设置的马可交换性数据,从而将当时 Diaconis和Freedman(1980)所发现的马可交换性链(Markov ) 的理论性数据扩展框架扩展到当时所观察到的逻辑性周期性预测性数据,这是从所观测到的逻辑上的逻辑性标准性数据的预结果。