Missing data is frequently encountered in practice. Propensity score estimation is a popular tool for handling such missingness. The propensity score is often developed using the model for the response probability, which can be subject to model misspecification. In this paper, we consider an alternative approach of estimating the inverse of the propensity scores using the density ratio function. By partitioning the sample into two groups based on the response status of the elements, we can apply the density ratio function estimation method and obtain the inverse propensity scores for nonresponse adjustment. Density ratio estimation can be obtained by applying the so-called maximum entropy method, which uses the Kullback-Leibler divergence measure under calibration constraints. By including the covariates for the outcome regression models only into the density ratio model, we can achieve efficient propensity score estimation. We further extend the proposed approach to the multivariate missing case. Some limited simulation studies are presented to compare with the existing methods.
翻译:在实际中经常遇到缺失的数据。 Probensity 评分估计是处理这种缺失的一个流行工具。 偏差评分通常使用响应概率模型来制定, 可能受模型错误区分。 在本文中, 我们考虑另一种方法, 利用密度比率函数来估计偏差分的反差。 通过根据元素的响应状态将样本分成两组, 我们可以应用密度比率估计函数法, 并获得不响应调整的反偏差分。 密度比估计可以通过应用所谓的最大倍增率方法获得, 这种方法在校准限制下使用 Kullback- Leibelter 差异度测量法。 通过将结果回归模型的共变数只纳入密度比率模型, 我们可以实现高效的偏差估计。 我们进一步将拟议方法扩展至多变量缺失案例。 一些有限的模拟研究可以与现有方法进行比较。