When are inferences (whether Direct-Likelihood, Bayesian, or Frequentist) obtained from partial data valid? This paper answers this question by offering a new theory about inference with missing data. It proves that as the sample size increases and the extent of missingness decreases, the mean-loglikelihood function generated by partial data and that ignores the missingness mechanism will almost surely converge uniformly to that which would have been generated by complete data; and if the data are Missing at Random (or "partially missing at random"), this convergence depends only on sample size. Thus, inferences from partial data, such as posterior modes, uncertainty estimates, confidence intervals, likelihood ratios, and indeed, all quantities or features derived from the partial-data loglikelihood function, will be consistently estimated. They will approximate their complete-data analogues. This adds to previous research which has only proved the consistency of the posterior mode. Practical implications of this result are discussed, and the theory is verified using a previous study of International Human Rights Law.
翻译:从部分数据中获得的推论何时有效(直接利基、巴耶斯或常识)?本文回答这一问题时,提供了对缺失数据推断的新理论。它证明随着抽样规模的扩大和缺失程度的缩小,部分数据产生的中位相似功能将几乎肯定会与完整数据产生的推论一致;如果数据在随机(或“部分随机失踪”)时缺失,这种趋同仅取决于抽样大小。因此,从部分数据(如后端模式、不确定性估计、信任期、概率比、以及实际上从部分数据对齐功能中得出的所有数量或特征的推论,将会得到一致的估计。它们将大致地估计完整的数据模拟。这与以前的研究相加,而以前的研究仅证明了后方数据模式的一致性。讨论了这一结果的实际影响,并用以前对国际人权法的研究核实了理论。