Estimating treatment effects conditional on observed covariates can improve the ability to tailor treatments to particular individuals. Doing so effectively requires dealing with potential confounding, and also enough data to adequately estimate effect moderation. A recent influx of work has looked into estimating treatment effect heterogeneity using data from multiple randomized controlled trials and/or observational datasets. With many new methods available for assessing treatment effect heterogeneity using multiple studies, it is important to understand which methods are best used in which setting, how the methods compare to one another, and what needs to be done to continue progress in this field. This paper reviews these methods broken down by data setting: aggregate-level data, federated learning, and individual participant-level data. We define the conditional average treatment effect and discuss differences between parametric and nonparametric estimators, and we list key assumptions, both those that are required within a single study and those that are necessary for data combination. After describing existing approaches, we compare and contrast them and reveal open areas for future research. This review demonstrates that there are many possible approaches for estimating treatment effect heterogeneity through the combination of datasets, but that there is substantial work to be done to compare these methods through case studies and simulations, extend them to different settings, and refine them to account for various challenges present in real data.
翻译:以观测到的共变体为条件估计治疗效果,可以提高使治疗适合特定个人的能力。要有效做到这样有效,就需要处理潜在的混乱,并有足够的数据来适当估计效果的温和程度。最近大量的工作利用多类随机控制的试验和/或观察数据集的数据,对治疗效果的异质性进行了估计。由于有许多新的方法可用于利用多种研究来评估治疗效果的异质性,因此必须了解哪些方法最适于用来确定哪些方法,方法如何相互比较,以及需要做些什么才能在这一领域继续取得进展。本文审查了这些方法,这些方法被数据设置(综合层次数据、联合学习和个别参与者一级数据)所打破。我们界定了有条件的平均治疗效果,并讨论了参数性和非参数性估算器之间的差异。我们列出了关键假设,既包括单项研究所要求的方法,也包括数据组合所必需的方法。在描述现有方法之后,我们比较和比较这些方法,并揭示了今后研究的开放领域。这一审查表明,有许多可能的方法可以用来估计治疗效果的异质性,但通过各种模拟方法来比较目前的数据组合,通过各种模拟方法,从而将数据推算为改进现有方法。</s>