While a randomized controlled trial (RCT) readily measures the average treatment effect (ATE), this measure may need to be generalized to the target population to account for a sampling bias in the RCT's population. Identifying this target population treatment effect needs covariates in both sets to capture all treatment effect modifiers that are shifted between the two sets. Standard estimators then use either weighting (IPSW), outcome modeling (G-formula), or combine the two in doubly robust approaches (AIPSW). However such covariates are often not available in both sets. Therefore, after completing existing proofs on the complete case consistency of those three estimators, we compute the expected bias induced by a missing covariate, assuming a Gaussian distribution and a semi-parametric linear model. This enables sensitivity analysis for each missing covariate pattern, giving the sign of the expected bias. We also show that there is no gain in imputing a partially-unobserved covariate. Finally we study the replacement of a missing covariate by a proxy. We illustrate all these results on simulations, as well as semi-synthetic benchmarks using data from the Tennessee Student/Teacher Achievement Ratio (STAR), and with a real-world example from critical care medicine.
翻译:虽然随机控制试验(RCT)可以很容易地测量平均治疗效果(ATE),但这一措施可能需要推广到目标人群,以说明RCT人口中抽样偏差。确定这个目标人群治疗效果需要在两套中都出现差异,以捕捉所有在两套之间转移的治疗效果修饰物。标准估计者然后使用加权(IPSW),结果模型(G-Formula),或将两种双倍稳健方法(AIPSW)结合起来。但这两种方法通常都没有这种共变法。因此,在完成关于这三个估计者完全一致的现有案例的证据之后,我们计算出一个缺失的共变数引起的预期偏差,假设高斯分布和半参数线性线性模型。这样可以对每一种缺失的共变差模式进行敏度分析,显示预期偏差的标志。我们还表明,将部分缺差的共变法方法(AIPSWE)纳入一个部分缺异变法方法(AIPSWWE)没有好处。最后,我们研究用一个代方取代缺失的同差法。我们用模拟/Sciencealgregree-theal数据来说明所有这些结果,并用Scial-Cregystemagnistal-Igregregregnistal/Igregregregal exgyalgnicalgal数据,我们用一个真实的模型/Calgyalgymalgymgymgygymgymalgyal数据。