The limited scope of Randomized Controlled Trials (RCT) is increasingly under scrutiny, in particular when samples are unrepresentative. Indeed, some RCTs over- or under- sample individuals with certain characteristics compared to the target population, for which one want to draw conclusions on treatment effectiveness. Re-weighting trial individuals to match the target population helps to improve the treatment effect estimation. Such procedures require an estimation of the ratio of the two densities (trial and target distributions). In this work, we establish the exact expressions of the bias and variance of such reweighting procedures -- also called Inverse Propensity of Sampling Weighting (IPSW) -- in presence of categorical covariates for any sample size. Such results allow us to compare the theoretical performance of different versions of IPSW estimates. Besides, our results show how the performance (bias, variance and quadratic risk) of IPSW estimates depends on the two sample sizes (RCT and target population). A by-product of our work is the proof of consistency of IPSW estimates. A refined analysis also shows that IPSW performances are improved when the trial probability to be treated is estimated (rather than using its oracle counterpart). In addition, we study how including covariates that are unnecessary to a proper estimation of the weights may impact the asymptotic variance. We illustrate all the takeaways twice: in a toy and didactic example, and on a semi-synthetic simulation inspired from critical care medicine.
翻译:随机控制试验(RCT)的有限范围正在日益受到审查,特别是当样品不具有代表性时。事实上,有些RCT的过重或过低抽样人与目标人群相比具有某些特征,与目标人群相比,有些RCT具有某些特征,因此人们想就治疗效果得出结论。为了与目标人群相比,重新加权试验人有助于改进治疗效果估计。这种程序要求估计两种密度(审判和目标分布)的比例。在这项工作中,我们确定这种重新加权程序的偏差和差异的确切表现 -- -- 也称为抽样加权(IPSW)的反比重(IPSW) -- -- 存在任何抽样规模的绝对变量。这些结果使我们能够比较IPSW不同版本估计数的理论性能。此外,我们的结果显示IPSW估计数的性能(偏差、差异和二次风险)取决于两种样本规模(RCT和目标分布)。我们工作的附带结果是证明IPSW估计数的一致性。 更精确的分析还表明,当我们用实验的概率来评估时,包括精确性能,我们用正确的模型来解释。