30-day hospital readmission is a long standing medical problem that affects patients' morbidity and mortality and costs billions of dollars annually. Recently, machine learning models have been created to predict risk of inpatient readmission for patients with specific diseases, however no model exists to predict this risk across all patients. We developed a bi-directional Long Short Term Memory (LSTM) Network that is able to use readily available insurance data (inpatient visits, outpatient visits, and drug prescriptions) to predict 30 day re-admission for any admitted patient, regardless of reason. The top-performing model achieved an ROC AUC of 0.763 (0.011) when using historical, inpatient, and post-discharge data. The LSTM model significantly outperformed a baseline random forest classifier, indicating that understanding the sequence of events is important for model prediction. Incorporation of 30-days of historical data also significantly improved model performance compared to inpatient data alone, indicating that a patients clinical history prior to admission, including outpatient visits and pharmacy data is a strong contributor to readmission. Our results demonstrate that a machine learning model is able to predict risk of inpatient readmission with reasonable accuracy for all patients using structured insurance billing data. Because billing data or equivalent surrogates can be extracted from sites, such a model could be deployed to identify patients at risk for readmission before they are discharged, or to assign more robust follow up (closer follow up, home health, mailed medications) to at-risk patients after discharge.
翻译:30天的医院重新接纳是一个长期存在的医学问题,影响到病人每年的发病率和死亡率以及费用达数十亿美元。最近,建立了机器学习模型,以预测特定疾病病人住院重新接纳的风险,然而,没有模型可以预测所有病人的这种风险。我们开发了一个双向长期短期内存(LSTM)网络,这个网络可以使用现成的保险数据(住院访问、门诊访问和药物处方)预测任何住院病人30天的重新接纳,而不管原因如何。最优秀的模型在使用历史、住院和放行后的数据时,实现了0.763(0.011)的ROC ASUC。LSTM模型大大超过一个基线随机森林分类器,表明了解事件的顺序对于模型预测很重要。纳入30天的历史数据还大大改进了模型性能,与仅住院数据相比,表明入院前的病人临床历史,包括门诊访问和药房数据,是重新接纳的有力贡献者。我们的研究结果表明,机器学习模型能够预测住院者在离家重新接收邮件的风险,在使用所有病人结构保险数据之前,在进行这样的数据库中,可以确定一个等值。