While Deep Learning (DL) is often considered the state-of-the art for Artificial Intelligence-based medical decision support, it remains sparsely implemented in clinical practice and poorly trusted by clinicians due to insufficient interpretability of neural network models. We have tackled this issue by developing interpretable DL models in the context of online detection of epileptic seizure, based on EEG signal. This has conditioned the preparation of the input signals, the network architecture, and the post-processing of the output in line with the domain knowledge. Specifically, we focused the discussion on three main aspects: 1) how to aggregate the classification results on signal segments provided by the DL model into a larger time scale, at the seizure-level; 2) what are the relevant frequency patterns learned in the first convolutional layer of different models, and their relation with the delta, theta, alpha, beta and gamma frequency bands on which the visual interpretation of EEG is based; and 3) the identification of the signal waveforms with larger contribution towards the ictal class, according to the activation differences highlighted using the DeepLIFT method. Results show that the kernel size in the first layer determines the interpretability of the extracted features and the sensitivity of the trained models, even though the final performance is very similar after post-processing. Also, we found that amplitude is the main feature leading to an ictal prediction, suggesting that a larger patient population would be required to learn more complex frequency patterns. Still, our methodology was successfully able to generalize patient inter-variability for the majority of the studied population with a classification F1-score of 0.873 and detecting 90% of the seizures.
翻译:虽然深学习(DL)通常被视为人工智能医疗决策支持的先进技术,但在临床实践中实施得很少,临床医生由于神经网络模型的可解释性不足而信任度差。我们已经在EEG信号的基础上开发了可解释的DL模型,以在线检测癫痫发病情况为基础,从而解决这一问题。这决定了输入信号的准备、网络结构以及根据域知识对输出进行后处理。具体地说,我们的讨论集中在三个主要方面:1)如何将DL模型提供的病人诊断分部分的分类结果汇总成更大的时间尺度,在缉获一级;2)不同模型第一个变异层所学的相关频率模式,以及它们与德尔塔、地、阿尔法、β和伽马频带的关系。这取决于输入信号的信号信号、网络结构以及根据域知识对输出的后处理。3)识别信号波变形,对病人分类的贡献更大,根据使用深LIFT方法突出的激活差异。结果显示,在经过培训的货币分析后,人口变异性是我们所了解的精度,最终的精细度是测量的精度。