Traditional machine learning methods face two main challenges in dealing with healthcare predictive analytics tasks. First, the high-dimensional nature of healthcare data needs labor-intensive and time-consuming processes to select an appropriate set of features for each new task. Secondly, these methods depend on feature engineering to capture the sequential nature of patient data, which may not adequately leverage the temporal patterns of the medical events and their dependencies. Recent deep learning methods have shown promising performance for various healthcare prediction tasks by addressing the high-dimensional and temporal challenges of medical data. These methods can learn useful representations of key factors (e.g., medical concepts or patients) and their interactions from high-dimensional raw (or minimally-processed) healthcare data. In this paper we systemically reviewed studies focused on using deep learning as the prediction model to leverage patient time series data for a healthcare prediction task from methodological perspective. To identify relevant studies, MEDLINE, IEEE, Scopus and ACM digital library were searched for studies published up to February 7th 2021. We found that researchers have contributed to deep time series prediction literature in ten research streams: deep learning models, missing value handling, irregularity handling, patient representation, static data inclusion, attention mechanisms, interpretation, incorporating medical ontologies, learning strategies, and scalability. This study summarizes research insights from these literature streams, identifies several critical research gaps, and suggests future research opportunities for deep learning in patient time series data.
翻译:传统机器学习方法在处理保健预测分析任务方面面临两大挑战:第一,保健数据的高层面性质需要劳动密集和耗时的过程,以便为每一项新任务选择一套适当的特征;第二,这些方法取决于特征工程,以捕捉病人数据的顺序性质,这可能无法充分利用医疗事件及其依赖性的时间模式;最近深层次的学习方法表明,通过应对医疗数据的高层次和时间挑战,各种保健预测任务表现良好。这些方法可以了解关键因素(如医疗概念或病人)及其从高层次原始(或处理最少的)保健数据中互动的有用表现。在本文件中,我们系统审查了侧重于利用深度学习作为预测模型的研究,以利用病人的时间序列数据,从方法角度出发进行保健预测任务。为了确定相关研究、MEDLINE、IEEE、Scopus和ACM数字图书馆在2021年2月7日之前公布的研究报告。我们发现,研究人员在10个研究流中为深刻的时间序列的预测文献作出了贡献:深层次的学习模型、缺失的价值观处理、不规则的研究、病人的学习研究、这些静态研究、研究、研究中的一些研究机会。