When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This paper answers this question by offering a new theory about inference with missing data. It proves that as the sample size increases and the extent of missingness decreases, the mean-loglikelihood function generated by partial data and that ignores the missingness mechanism will almost surely converge uniformly to that which would have been generated by complete data; and if the data are Missing at Random, this convergence depends only on sample size. Thus, inferences on partial data, such as posterior modes, uncertainty estimates, confidence intervals, likelihood ratios, and indeed, all quantities or features derived from the partial-data loglikelihood function, will approximate their true values (what they would have been given complete data). This adds to previous research which has only proved the consistency of the posterior mode. Practical implications of this result are discussed, and the theory is tested on a previous study of International Human Rights Law.
翻译:当从部分数据中获得推论(直接获益、贝叶斯或常客)时,何时从部分数据中获得推论(直接获益、贝叶斯或常客)是有效的?本文件回答这一问题时,提供了对缺失数据推断的新理论。它证明随着抽样规模的扩大和缺失程度的缩小,部分数据产生的中位相似功能将几乎肯定会与完整数据产生的推论一致;如果数据在随机时缺失,这种趋同仅取决于抽样大小。因此,对部分数据(例如后方模式、不确定性估计、信任期、概率比率,以及实际上从部分数据日志函数中得出的所有数量或特征)的推论将接近其真实值(它们本来会得到哪些完整数据 ) 。这补充了以前的研究,这些研究只证明后方模式的一致性。讨论了这一结果的实际影响,并在以前对国际人权法的研究中测试了理论。