Deep neural networks have shown promising results for various clinical prediction tasks. However, training deep networks such as those based on Recurrent Neural Networks (RNNs) requires large labeled data, significant hyper-parameter tuning effort and expertise, and high computational resources. In this work, we investigate as to what extent can transfer learning address these issues when using deep RNNs to model multivariate clinical time series. We consider two scenarios for transfer learning using RNNs: i) domain-adaptation, i.e., leveraging a deep RNN - namely, TimeNet - pre-trained for feature extraction on time series from diverse domains, and adapting it for feature extraction and subsequent target tasks in healthcare domain, ii) task-adaptation, i.e., pre-training a deep RNN - namely, HealthNet - on diverse tasks in healthcare domain, and adapting it to new target tasks in the same domain. We evaluate the above approaches on publicly available MIMIC-III benchmark dataset, and demonstrate that (a) computationally-efficient linear models trained using features extracted via pre-trained RNNs outperform or, in the worst case, perform as well as deep RNNs and statistical hand-crafted features based models trained specifically for target task; (b) models obtained by adapting pre-trained models for target tasks are significantly more robust to the size of labeled data compared to task-specific RNNs, while also being computationally efficient. We, therefore, conclude that pre-trained deep models like TimeNet and HealthNet allow leveraging the advantages of deep learning for clinical time series analysis tasks, while also minimize dependence on hand-crafted features, deal robustly with scarce labeled training data scenarios without overfitting, as well as reduce dependence on expertise and resources required to train deep networks from scratch.
翻译:深心神经网络为各种临床预测任务展示了有希望的成果。然而,培训深度网络,如基于常规神经网络的深度网络,需要大量标签数据、大量的超参数调整努力和专门知识以及高计算资源。在这项工作中,我们调查在使用深度RNN网络来模拟多变临床时间序列时,在多大程度上可以转移学习解决这些问题。我们考虑两种利用RNN网络进行传输学习的情景:i) 域适应,即利用深度的RNN(即TimeNet)――从不同领域对时间序列的特征提取进行预先培训,并调整网络在保健领域对地貌特征的提取和随后的目标任务,二)任务适应性,即,在保健领域对深度RNNN(即健康网络)进行不同任务的培训,并适应同一领域新的目标任务。我们评估了上述在公开提供的MIMIC-III基准数据集方面的做法,并表明,(a) 利用通过事先培训的 RNNW 的深度数据序列提取的特征提取和随后的具体目标分析,同时,通过经过更深入的统计任务,通过经过更精确的模型进行更精确的计算,同时进行更精确的计算,在最精确的统计任务中,在最精细的模型上进行。