Despite their remarkable performance, deep neural networks remain unadopted in clinical practice, which is considered to be partially due to their lack in explainability. In this work, we apply attribution methods to a pre-trained deep neural network (DNN) for 12-lead electrocardiography classification to open this "black box" and understand the relationship between model prediction and learned features. We classify data from a public data set and the attribution methods assign a "relevance score" to each sample of the classified signals. This allows analyzing what the network learned during training, for which we propose quantitative methods: average relevance scores over a) classes, b) leads, and c) average beats. The analyses of relevance scores for atrial fibrillation (AF) and left bundle branch block (LBBB) compared to healthy controls show that their mean values a) increase with higher classification probability and correspond to false classifications when around zero, and b) correspond to clinical recommendations regarding which lead to consider. Furthermore, c) visible P-waves and concordant T-waves result in clearly negative relevance scores in AF and LBBB classification, respectively. In summary, our analysis suggests that the DNN learned features similar to cardiology textbook knowledge.
翻译:尽管临床实践表现出色,但深神经网络仍未在临床实践中被采纳,这在某种程度上被认为是由于缺乏解释。在这项工作中,我们将归属方法应用到一个受过训练的深神经网络(DNN),用于12级铅电心术分类,以打开这个“黑盒”并理解模型预测和学习特征之间的关系。我们从公共数据集和分配方法中分类数据,为每个分类信号样本分配了“相关性评分”,从而可以分析在培训期间学到的网络,为此我们建议了定量方法:a类、b类、铅和c类的平均相关评分。与健康控制相比,我们应用到一个受过训练的深神经网络(DNNN),用于12级铅电心术分类和左捆绑的分块(LBBBB)的相关评分分析表明,其平均值a)随着较高的分类概率增加,并在零左右与错误的分类相对应,b)符合临床建议。此外,明显的P波和一致的T波导致在AF和LBBB分类中明显具有负相关性的评分。我们的分析表明,DNN所学的特征显示,DN所学的特征与DMSBI的相类似。