多种方法用于整合试验和非实验数据以研究治疗效果的异质性 (Methods for Integrating Trials and Non-Experimental Data to Examine Treatment Effect Heterogeneity)

Estimating treatment effects conditional on observed covariates can improve the ability to tailor treatments to particular individuals. Doing so effectively requires dealing with potential confounding, and also enough data to adequately estimate effect moderation. A recent influx of work has looked into estimating treatment effect heterogeneity using data from multiple randomized controlled trials and/or observational datasets. With many new methods available for assessing treatment effect heterogeneity using multiple studies, it is important to understand which methods are best used in which setting, how the methods compare to one another, and what needs to be done to continue progress in this field. This paper reviews these methods broken down by data setting: aggregate-level data, federated learning, and individual participant-level data. We define the conditional average treatment effect and discuss differences between parametric and nonparametric estimators, and we list key assumptions, both those that are required within a single study and those that are necessary for data combination. After describing existing approaches, we compare and contrast them and reveal open areas for future research. This review demonstrates that there are many possible approaches for estimating treatment effect heterogeneity through the combination of datasets, but that there is substantial work to be done to compare these methods through case studies and simulations, extend them to different settings, and refine them to account for various challenges present in real data.

翻译：通过观察协变量估计治疗效应，可以提高将治疗个体化的能力。这需要处理潜在的混淆因素，并且需要充足的数据以充分估计效果异质性。最近，已经有很多新方法利用多个随机对照试验和/或观察数据集来估计治疗效果的异质性。有了这么多新方法可用于从多个研究中评估治疗效果的异质性，有必要了解哪些方法在哪种情况下最好使用，这些方法彼此之间的比较以及需要做什么来继续推进研究领域。本文通过数据设置对这些方法进行了归类：聚合水平数据，联邦学习和个体参与者级别数据。我们定义了条件平均治疗效应，并讨论了参数估计和非参数估计之间的差异，也列举了关键的假设，既包括单个研究所需的假设，也包括数据合并所需的假设。在描述现有方法之后，我们进行了比较和对比，并揭示了未来研究的开放领域。该综述表明，通过数据集的结合，有许多可能用于估计治疗效果异质性的方法，但要比较这些方法需要通过案例研究和模拟，将其扩展到不同的环境，并改进使其适应真实数据中存在的各种挑战。