Mahalanobis distance between treatment group and control group covariate means is often adopted as a balance criterion when implementing a rerandomization strategy. However, this criterion may not work well for high-dimensional cases because it balances all orthogonalized covariates equally. Here, we propose leveraging principal component analysis (PCA) to identify proper subspaces in which Mahalanobis distance should be calculated. Not only can PCA effectively reduce the dimensionality for high-dimensional cases while capturing most of the information in the covariates, but it also provides computational simplicity by focusing on the top orthogonal components. We show that our PCA rerandomization scheme has desirable theoretical properties on balancing covariates and thereby on improving the estimation of average treatment effects. We also show that this conclusion is supported by numerical studies using both simulated and real examples.
翻译:Mahalanobis 处理组与控制组共变方法之间的距离,在执行重整战略时,往往被当作平衡标准。然而,这一标准在高维情况中可能不起作用,因为它平衡了所有正对等共变。在这里,我们提议利用主元组成部分分析(PCA)来确定应当计算马哈拉诺比距离的适当次空间。不仅五氯苯甲醚能够有效地减少高维病例的维度,同时捕捉共变中的大部分信息,而且还通过侧重于顶部正方形组件提供计算简单性。我们表明,我们的五氯苯甲醚重整方案在平衡共变换并从而改进平均治疗效果的估算方面具有可取的理论属性。我们还表明,使用模拟和真实实例进行的数字研究支持这一结论。