Conformal prediction is a widely used method to quantify uncertainty in settings where the data is independent and identically distributed (IID), or more generally, exchangeable. Conformal prediction takes in a pre-trained classifier, a calibration dataset and a confidence level as inputs, and returns a function which maps feature vectors to subsets of classes. The output of the returned function for a new feature vector (i.e., a test data point) is guaranteed to contain the true class with the pre-specified confidence. Despite its success and usefulness in IID settings, extending conformal prediction to non-exchangeable (e.g., Markovian) data in a manner that provably preserves all desirable theoretical properties has largely remained an open problem. As a solution, we extend conformal prediction to the setting of a Hidden Markov Model (HMM) with unknown parameters. The key idea behind the proposed method is to partition the non-exchangeable Markovian data from the HMM into exchangeable blocks by exploiting the de Finetti's Theorem for Markov Chains discovered by Diaconis and Freedman (1980). The permutations of the exchangeable blocks are then viewed as randomizations of the observed Markovian data from the HMM. The proposed method provably retains all desirable theoretical guarantees offered by the classical conformal prediction framework.
翻译:共变预测是一种广泛使用的方法,用于量化数据独立和同样分布(IID)或更普遍地可以互换的环境下的不确定性; 共变预测以预先训练的分类器、校准数据集和信任度作为输入,并返回一个功能,用于绘制矢量图到分类子组。 新的特性矢量(即测试数据点)的返回函数输出保证包含有预先指定的信任度的真实类别。 尽管它在ID的设置中取得了成功和有用,但将符合的预测扩大到非交换性(例如Markovian)数据,从而以可实现的保存所有理想理论属性的方式,这在很大程度上仍然是一个尚未解决的问题。 作为一种解决办法,我们将符合的预测扩展到具有未知参数的隐藏马尔科夫模型(HMM)的设置。 拟议方法的关键理念是利用Diaconis和Freedman(1980)所发现的马可交换性链条(例如Markov contraces)的理论性预测值,将非交换性数据从HMMM数据中分离成可交换的状态框架扩展为可观察的可观察的模型。