Sepsis is a life-threatening organ malfunction caused by the host's inability to fight infection, which can lead to death without proper and immediate treatment. Therefore, early diagnosis and medical treatment of sepsis in critically ill populations at high risk for sepsis and sepsis-associated mortality are vital to providing the patient with rapid therapy. Studies show that advancing sepsis detection by 6 hours leads to earlier administration of antibiotics, which is associated with improved mortality. However, clinical scores like Sequential Organ Failure Assessment (SOFA) are not applicable for early prediction, while machine learning algorithms can help capture the progressing pattern for early prediction. Therefore, we aim to develop a machine learning algorithm that predicts sepsis onset 6 hours before it is suspected clinically. Although some machine learning algorithms have been applied to sepsis prediction, many of them did not consider the fact that six hours is not a small gap. To overcome this big gap challenge, we explore a multi-subset approach in which the likelihood of sepsis occurring earlier than 6 hours is output from a previous subset and feed to the target subset as additional features. Moreover, we use the hourly sampled data like vital signs in an observation window to derive a temporal change trend to further assist, which however is often ignored by previous studies. Our empirical study shows that both the multi-subset approach to alleviating the 6-hour gap and the added temporal trend features can help improve the performance of sepsis-related early prediction.
翻译:脓毒症是由感染引起的生命垂危的器官衰竭,它可能导致患者在没有适当和及时治疗的情况下死亡。因此,在高危脓毒症和脓毒症相关死亡率的危重患者中早期诊断和医疗治疗是向患者提供快速治疗的关键。研究表明,提前6小时诊断脓毒症可以更早地给予抗生素治疗,这与改善死亡率有关。然而,类似于顺序器官衰竭评分(SOFA)的临床评分不适用于早期预测,而机器学习算法可以帮助捕捉早期预测的进展模式。因此,我们的目标是开发一种机器学习算法,可以在临床怀疑之前6小时预测脓毒症发作。虽然一些机器学习算法已经应用于脓毒症预测,但其中许多都没有考虑6小时不是一个小的时间差这一事实。为了解决这个巨大的时间间隔问题,我们探索了一种多子集方法,其中早于6小时脓毒症发生的可能性是来自前一个子集的输出,并作为额外特征输入到目标子集中。此外,我们使用了以小时为单位采样的数据,例如观测窗口中的生命体征,以导出时间变化趋势以进一步协助,然而,这常常被先前的研究忽略。我们的实证研究显示,减轻6小时间隔的多子集方法和增加的时间趋势特征都可以帮助改善与脓毒症相关的早期预测的性能。