稳定算法的有条件预测推论 (Conditional predictive inference for stable algorithms)

We investigate generically applicable and intuitively appealing prediction intervals based on $k$-fold cross validation. We focus on the conditional coverage probability of the proposed intervals, given the observations in the training sample (hence, training conditional validity), and show that it is close to the nominal level, in an appropriate sense, provided that the underlying algorithm used for computing point predictions is sufficiently stable when feature-response pairs are omitted. Our results are based on a finite sample analysis of the empirical distribution function of $k$-fold cross validation residuals and hold in non-parametric settings with only minimal assumptions on the error distribution. To illustrate our results, we also apply them to high-dimensional linear predictors, where we obtain uniform asymptotic training conditional validity as both sample size and dimension tend to infinity at the same rate and consistent parameter estimation typically fails. These results show that despite the serious problems of resampling procedures for inference on the unknown parameters (cf. Bickel and Freedman, 1983; El Karoui and Purdom, 2018; Mammen, 1996), cross validation methods can be successfully applied to obtain reliable predictive inference even in high dimensions and conditionally on the training data.

翻译：我们根据以美元乘数校准法调查一般适用和直觉具有吸引力的预测间隔。我们侧重于拟议间隔的有条件覆盖概率,考虑到培训样本中的观察结果(因此,培训有条件有效性),我们注重拟议的间隔的有条件覆盖概率,并表明,如果计算点预测所使用的基本算法在省略地响应对配方时足够稳定,在适当意义上接近名义水平,只要计算点预测所使用的基本算法在特性响应对配方省略时足够稳定。我们的结果基于对美元乘数校准剩余物的经验分配功能的有限抽样分析,并保存在非参数中,只有最低的误差分布假设。为了说明我们的结果,我们还将它们应用到高维线性线性预测器,因为我们获得统一的无线性培训的有条件有效性,因为样本大小和尺寸都倾向于同一比例和一致参数估计通常都失败。这些结果显示,尽管对未知参数的推断程序存在重新采样的严重问题(参见Bickel和Freedman,1983年;El Karoui和Purdom,2018;Mem,1996年),可以成功地应用交叉验证方法,以获得可靠的预测性数据,即使是在高维度和有条件的培训。

相关内容

交叉验证

关注 2

交叉验证，有时也称为旋转估计或样本外测试，是用于评估统计结果如何的各种类似模型验证技术中的任何一种分析将概括为一个独立的数据集。它主要用于设置，其目的是预测，和一个想要估计如何准确地一个预测模型在实践中执行。在预测问题中，通常会给模型一个已知数据的数据集，在该数据集上进行训练（训练数据集）以及未知数据（或首次看到的数据）的数据集（根据该数据集测试模型）（称为验证数据集或测试集）。交叉验证的目标是测试模型预测未用于估计数据的新数据的能力，以发现诸如过度拟合或选择偏倚之类的问题，并提供有关如何进行建模的见解。该模型将推广到一个独立的数据集（例如，未知数据集，例如来自实际问题的数据集）。一轮交叉验证涉及分割一个样品的数据到互补的子集，在一个子集执行所述分析（称为训练集），以及验证在另一子集中的分析（称为验证集合或测试集）。为了减少可变性，在大多数方法中，使用不同的分区执行多轮交叉验证，并将验证结果组合（例如取平均值）在各轮中，以估计模型的预测性能。总而言之，交叉验证结合了预测中适用性的度量（平均），以得出模型预测性能的更准确估计。

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

82+阅读 · 2020年7月26日

因果图，Causal Graphs，52页ppt

专知会员服务

253+阅读 · 2020年4月19日