通过De Finetti的Markov 链条理论扩展对具有精确有效性的隐藏 Markov 模型的共形预测 (Extending Conformal Prediction to Hidden Markov Models with Exact Validity via de Finetti's Theorem for Markov Chains)

Conformal prediction is a widely used method to quantify uncertainty in settings where the data is independent and identically distributed (IID), or more generally, exchangeable. Conformal prediction takes in a pre-trained classifier, a calibration dataset and a confidence level as inputs, and returns a function which maps feature vectors to subsets of classes. The output of the returned function for a new feature vector (i.e., a test data point) is guaranteed to contain the true class with the pre-specified confidence. Despite its success and usefulness in IID settings, extending conformal prediction to non-exchangeable (e.g., Markovian) data in a manner that provably preserves all desirable theoretical properties has largely remained an open problem. As a solution, we extend conformal prediction to the setting of a Hidden Markov Model (HMM) with unknown parameters. The key idea behind the proposed method is to partition the non-exchangeable Markovian data from the HMM into exchangeable blocks by exploiting the de Finetti's Theorem for Markov Chains discovered by Diaconis and Freedman (1980). The permutations of the exchangeable blocks are then viewed as randomizations of the observed Markovian data from the HMM. The proposed method provably retains all desirable theoretical guarantees offered by the classical conformal prediction framework. Detailed numerical results that verify and compliment the theoretical conclusions are provided to illustrate the performance of the proposed method.

翻译：共变预测是一种广泛使用的方法,用于量化数据独立和同样分布(IID)或更普遍地可以互换的环境下的不确定性; 共变预测以预先训练的分类器、校准数据集和信任度作为投入,并返回一个功能,用于将矢量映射到类别子集。新特性矢量(即测试数据点) 的返回函数输出保证包含有预先指定的信任度的真实类别。尽管它在ID的设置中取得了成功和有用,但将符合的预测扩大到非交换性(例如Markovian)数据,从而以可可被确认的方式保存所有可取的理论属性,这在很大程度上仍然是一个尚未解决的问题。作为一种解决办法,我们将符合的预测扩展到含有未知参数的隐藏的Markov模型(HMM)的设置。拟议方法的关键理念是通过利用Diaconis和Freedman(1980)所发现的马可互换数据链(例如Markov contracils)的准确性预测值框架,将非交换的Markovian 数据输出为可互换的准确性标准。所观察到的逻辑性结果的随机性结论是从所观察到的HMMM的逻辑方法提供的。