Complete randomization allows for consistent estimation of the average treatment effect based on the difference in means of the outcomes without strong modeling assumptions on the outcome-generating process. Appropriate use of the pretreatment covariates can further improve the estimation efficiency. However, missingness in covariates is common in experiments and raises an important question: should we adjust for covariates subject to missingness, and if so, how? The unadjusted difference in means is always unbiased. The complete-covariate analysis adjusts for all completely observed covariates and improves the efficiency of the difference in means if at least one completely observed covariate is predictive of the outcome. Then what is the additional gain of adjusting for covariates subject to missingness? A key insight is that the missingness indicators act as fully observed pretreatment covariates as long as missingness is not affected by the treatment, and can thus be used in covariate adjustment to bring additional estimation efficiency. This motivates adding the missingness indicators to the regression adjustment, yielding the missingness-indicator method as a well-known but not so popular strategy in the literature of missing data. We recommend it due to its many advantages. We also propose modifications to the missingness-indicator method based on asymptotic and finite-sample considerations. To reconcile the conflicting recommendations in the missing data literature, we analyze and compare various strategies for analyzing randomized experiments with missing covariates under the design-based framework. This framework treats randomization as the basis for inference and does not impose any modeling assumptions on the outcome-generating process and missing-data mechanism.
翻译:完全随机随机化使得能够根据结果手段的差异对平均处理效果进行一致估计,而没有在成果产生过程上进行强有力的模型假设。 适当使用预处理共变法可以进一步提高估计效率。 但是,共变法中的缺漏在实验中很常见,并提出了一个重要问题:我们是否应该调整因缺漏而出现的共变法,如果是的话,如何? 未经调整的手段差异总是没有偏见的。 完整的差变分析对所有完全观察到的完全观察到的共变法进行调整,提高手段差异的效率,如果至少一个完全观测到的共变法是预测结果的。 适当使用预处理共变法可以进一步提高估算效率。 适当使用预变法的差在实验中是常见的,只要缺缺漏不受到处理的影响,那么,就会产生一个重要问题。 完整的差变差分析法分析方法使得缺漏率调整成为一种广为人知的但并不流行的计算结果。 我们建议对缺漏数据的文献中的缺漏数据进行更多的分析,我们建议对误化方法进行更精确的分析。