Heart failure (HF) is a major cause of mortality. Accurately monitoring HF progress and adjust therapies are critical for improving patient outcomes. An experienced cardiologist can make accurate HF stage diagnoses based on combination of symptoms, signs, and lab results from the electronic health records (EHR) of a patient, without directly measuring heart function. We examined whether machine learning models, more specifically the XGBoost model, can accurately predict patient stage based on EHR, and we further applied the SHapley Additive exPlanations (SHAP) framework to identify informative features and their interpretations. Our results indicate that based on structured data from EHR, our models could predict patients' ejection fraction (EF) scores with moderate accuracy. SHAP analyses identified informative features and revealed potential clinical subtypes of HF. Our findings provide insights on how to design computing systems to accurately monitor disease progression of HF patients through continuously mining patients' EHR data.
翻译:心脏衰竭(HF)是导致死亡的一个主要原因。准确监测高频进展和调整疗法对于改善患者结果至关重要。有经验的心脏病学家可以根据症状、征兆和病人电子健康记录(EHR)的实验室结果综合进行准确的高频阶段诊断,而不直接测量心脏功能。我们研究了机器学习模型,更具体地说是XGBoost模型,是否能够根据EHR准确预测患者阶段,我们进一步运用了Shampley Additive Explectation(SHAP)框架来确定信息特征及其解释。我们的结果表明,根据EHR的结构性数据,我们的模型可以以中度精确度预测病人的排出分数。SHFP分析确定了信息特征,并揭示了潜在的临床次型HF。我们的研究发现如何设计计算机系统,通过持续挖掘病人的 EHR数据来准确监测高频病人的疾病发展。