Randomized Controlled Trials (RCTs) are often considered as the gold standard to conclude on the causal effect of a given intervention on an outcome, but they may lack of external validity when the population eligible to the RCT is substantially different from the target population. Having at hand a sample of the target population of interest allows to generalize the causal effect. Identifying this target population treatment effect needs covariates in both sets to capture all treatment effect modifiers that are shifted between the two sets. However such covariates are often not available in both sets. Standard estimators then use either weighting (IPSW), outcome modeling (G-formula), or combine the two in doubly robust approaches (AIPSW). In this paper, after completing existing proofs on the complete case consistency of those three estimators, we compute the expected bias induced by a missing covariate, assuming a Gaussian distribution and a semi-parametric linear model. This enables sensitivity analysis for each missing covariate pattern, giving the sign of the expected bias. We also show that there is no gain in imputing a partially-unobserved covariate. Finally we study the replacement of a missing covariate by a proxy. We illustrate all these results on simulations, as well as semi-synthetic benchmarks using data from the Tennessee Student/Teacher Achievement Ratio (STAR), and with a real-world example from critical care medicine.
翻译:控制控制试验(RCTs)通常被视为就某项干预对结果产生的因果影响得出结论的黄金标准,但当有资格获得RCT的人口与目标人口大不相同时,这种标准可能缺乏外部有效性。当有资格获得RCT的人口与目标人口大不相同时,对感兴趣的目标人口进行抽样抽样,可以概括因果关系效应。确定这个目标的人口处理效果,需要在两组中进行共变换,以捕捉在两组之间转移的所有治疗效果变换。但这两种组合通常都没有。标准估计者然后使用加权(IPSW)、结果模型(G-Formula),或者将两种双重强势方法结合起来(AIPSW),这些结果可能缺乏外部有效性。在本文件中,在完成关于这三个估计者完全一致的现有案例证据后,我们计算出由缺失的 Coverate 所引发的预期偏差,假设一个高音分布和半偏差线性线性模型。这样就可以对每一个缺失的调调模式进行感应进行感性分析,并显示预期的偏差。我们还表明,在将一个精确的医学比数模型中,即我们用这些误差数据作为共同的模型来替换。