Background: The methods with which prediction models are usually developed mean that neither the parameters nor the predictions should be interpreted causally. For many applications this is perfectly acceptable. However, when prediction models are used to support decision making, there is often a need for predicting outcomes under hypothetical interventions. Aims: We aimed to identify and compare published methods for developing and validating prediction models that enable risk estimation of outcomes under hypothetical interventions, utilizing causal inference. We aimed to identify the main methodological approaches, their underlying assumptions, targeted estimands, and possible sources of bias. Finally, we aimed to highlight unresolved methodological challenges. Methods: We systematically reviewed literature published by December 2019, considering papers in the health domain that used causal considerations to enable prediction models to be used to evaluate predictions under hypothetical interventions. We included both methodology development studies and applied studies. Results: We identified 4919 papers through database searches and a further 115 papers through manual searches. Of these, 87 papers were retained for full text screening, of which 12 were selected for inclusion. We found papers from both the statistical and the machine learning literature. Most of the identified methods for causal inference from observational data were based on marginal structural models and g-estimation.
翻译:目标:我们的目标是查明和比较已公布的用于开发和验证预测模型的方法,以便能够利用因果推断对假设干预措施的结果进行风险估计。我们的目标是查明主要方法、其基本假设、目标估计和可能的偏差来源。最后,我们的目标是突出尚未解决的方法挑战。方法:我们系统地审查了2019年12月前出版的文献,审议了卫生领域的文件,其中使用了因果考虑因素,使预测模型能够用来评价假设干预措施下的预测。我们既包括方法开发研究和应用研究。结果:我们通过数据库搜索确定了4919份文件,又通过人工搜索确定了115份文件。结果:我们通过数据库搜索确定了4919份文件,其中87份文件被保留用于全文筛选,其中12份文件被选中供列入。我们从统计和机器学习文献中找到了文件。我们从观察数据中找出的因果推断方法大多以边缘结构模型为基础。