We aim to generalize the results of a randomized controlled trial (RCT) to a target population with the help of some observational data. This is a problem of causal effect identification with multiple data sources. Challenges arise when the RCT is conducted in a context that differs from the target population. Earlier research has focused on cases where the estimates from the RCT can be adjusted by observational data in order to remove the selection bias and other domain specific differences. We consider examples where the experimental findings cannot be generalized by an adjustment and show that the generalization may still be possible by other identification strategies that can be derived by applying do-calculus. The obtained identifying functionals for these examples contain trapdoor variables of a new type. The value of a trapdoor variable needs to be fixed in the estimation and the choice of the value may have a major effect on the bias and accuracy of estimates, which is also seen in simulations. The presented results expand the scope of settings where the generalization of experimental findings is doable
翻译:我们的目标是借助某些观测数据,将随机控制试验的结果推广到目标人群,这是一个与多个数据源的因果关系识别问题。当在与目标人群不同的背景下进行RCT时,将出现挑战。早期研究侧重于通过观察数据对RCT的估计数进行调整以便消除选择偏差和其他特定领域差异的案例。我们考虑了实验结果无法通过调整而普遍化的例子,并表明通过应用计算方法可以得出的其他识别战略仍可能实现这种普遍化。这些例子的识别功能包含一种新类型的陷阱门变量。在估计中需要固定一个陷阱门变量的价值,而且该数值的选择可能对估计的偏差和准确性产生重大影响,这一点在模拟中也可以看到。提出的结果扩大了实验结果可实现普遍性的环境范围。