Deep learning has demonstrated success in many applications; however, their use in healthcare has been limited due to the lack of transparency into how they generate predictions. Algorithms such as Recurrent Neural Networks (RNNs) when applied to Electronic Medical Records (EMR) introduce additional barriers to transparency because of the sequential processing of the RNN and the multi-modal nature of EMR data. This work seeks to improve transparency by: 1) introducing Learned Binary Masks (LBM) as a method for identifying which EMR variables contributed to an RNN model's risk of mortality (ROM) predictions for critically ill children; and 2) applying KernelSHAP for the same purpose. Given an individual patient, LBM and KernelSHAP both generate an attribution matrix that shows the contribution of each input feature to the RNN's sequence of predictions for that patient. Attribution matrices can be aggregated in many ways to facilitate different levels of analysis of the RNN model and its predictions. Presented are three methods of aggregations and analyses: 1) over volatile time periods within individual patient predictions, 2) over populations of ICU patients sharing specific diagnoses, and 3) across the general population of critically ill children.
翻译:在许多应用中,深层次的学习证明是成功的;然而,由于在如何作出预测方面缺乏透明度,在保健方面的使用有限;在应用电子医疗记录时,诸如经常性神经网络等常神经网络(NNN)增加了透明度障碍,因为对RNN的连续处理和EMR数据的多模式性质,这项工作力求提高透明度,其方法是:(1) 采用学习的双筒面具(LBM),作为确定哪些EMM变量有助于RN模型对严重疾病儿童的死亡率(ROM)预测;(2) 为同一目的应用KernelSHAP。鉴于个别病人、LBM和KernelSHAP都产生了一个归属矩阵,显示每个输入特征对RNN的病人预测序列的贡献。可以以多种方式汇总各种归称矩阵,以便利对RNN模型及其预测进行不同程度的分析。提出的汇总和分析方法有三种:(1) 在个别病人预测的期间,2) 共享特定诊断的ICU病人群体,3 以及整个儿童。