Objective. Deep neural networks (DNNs) have shown unprecedented success in various brain-machine interface applications such as epileptic seizure prediction. However, existing approaches typically train models in a patient-specific fashion due to the highly personalized characteristics of epileptic signals. Therefore, only a limited number of labeled recordings from each subject can be used for training. As a consequence, current DNN based methods demonstrate poor generalization ability to some extent due to the insufficiency of training data. On the other hand, patient-independent models attempt to utilize more patient data to train a universal model for all patients by pooling patient data together. Despite different techniques applied, results show that patient-independent models perform worse than patient-specific models due to high individual variation across patients. A substantial gap thus exists between patient-specific and patient-independent models. Approach. In this paper, we propose a novel training scheme based on knowledge distillation which makes use of a large amount of data from multiple subjects. It first distills informative features from signals of all available subjects with a pre-trained general model. A patient-specific model can then be obtained with the help of distilled knowledge and additional personalized data. Main results. Four state-of-the-art seizure prediction methods are trained on the Children's Hospital of Boston-MIT sEEG database with our proposed scheme. The resulting accuracy, sensitivity, and false prediction rate show that our proposed training scheme consistently improves the prediction performance of state-of-the-art methods by a large margin. Significance. The proposed training scheme significantly improves the performance of patient-specific seizure predictors and bridges the gap between patient-specific and patient-independent predictors.
翻译:目标:深神经网络(DNNs)在各种脑机器接口应用中表现出前所未有的成功,例如癫痫癫痫发作预测;然而,由于癫痫发作信号的高度个性化特点,现有方法通常以病人特有的模式培训模型;因此,每个学科的标签记录数量有限,因此只能用于培训;因此,目前基于DNNs的方法在某种程度上表明,由于培训数据不足,普遍化能力较差;另一方面,依赖病人的模型试图利用更多的病人数据,通过将病人的数据汇集在一起,为所有病人培训一个通用模型。尽管采用不同的技术,但结果显示,由于病人之间的个人差异很大,依赖病人的模型比针对病人的模型效果要差。因此,每个科目的贴标签记录数量有限。因此,根据知识的提法,我们提出了一个新的培训计划,利用了来自多个科目的大量数据。首先从所有现有科目的信号中提取了信息性特征,将病人的数据汇集在一起。之后,通过应用不同的技术,可以取得一个病人特有的模式,依靠病人的诊断性机能的模型比具体模型比病人特有的模型,由于病人的个人个人之间的个人差异预测率,从而大幅改进了我们所培训的预估测测测测测测测测的进度。