Objectives: To evaluate the consequences of the framing of machine learning risk prediction models. We evaluate how framing affects model performance and model learning in four different approaches previously applied in published artificial-intelligence (AI) models. Setting and participants: We analysed structured secondary healthcare data from 221,283 citizens from four Danish municipalities who were 18 years of age or older. Results: The four models had similar population level performance (a mean area under the receiver operating characteristic curve of 0.73 to 0.82), in contrast to the mean average precision, which varied greatly from 0.007 to 0.385. Correspondingly, the percentage of missing values also varied between framing approaches. The on-clinical-demand framing, which involved samples for each time the clinicians made an early warning score assessment, showed the lowest percentage of missing values among the vital sign parameters, and this model was also able to learn more temporal dependencies than the others. The Shapley additive explanations demonstrated opposing interpretations of SpO2 in the prediction of sepsis as a consequence of differentially framed models. Conclusions: The profound consequences of framing mandate attention from clinicians and AI developers, as the understanding and reporting of framing are pivotal to the successful development and clinical implementation of future AI technology. Model framing must reflect the expected clinical environment. The importance of proper problem framing is by no means exclusive to sepsis prediction and applies to most clinical risk prediction models.
翻译:目标:评估设计机器学习风险预测模型的影响; 我们评估设计对模型业绩和模型学习的影响,以前在公布的人工智能模型中应用了四种不同方法; 设置和参与者: 我们分析了来自四个丹麦城市、18岁或18岁以上的221 283名18岁或18岁以上的221 283名公民的结构性二级保健数据; 结果:这四个模型具有类似的人口水平业绩(接收器操作特征曲线0.73至0.82下的一个平均区域),而平均精确度则大相径庭,从0.007到0.385不等; 相应的是,缺失值的百分比在设计方法之间也各不相同。 临床需求框架涉及临床医生和AI开发商每次进行预警分数评估的样本,显示关键信号参数中缺失值的百分比最低,这一模型也能够了解比其他模型更多的时间依赖性。 与平均平均精确度相比,平均精确度从0.007到0.385不等。 相应的结论: 临床模型设计对任务关注的深刻后果,因为临床学家和AI开发商家,这是临床预测中最能理解和报告的临床预测,因此,制定正确的临床预测必须反映正确的临床预测的正确发展方式。