Deep Learning (DL) have greatly contributed to bioelectric signals processing, in particular to extract physiological markers. However, the efficacy and applicability of the results proposed in the literature is often constrained to the population represented by the data used to train the models. In this study, we investigate the issues related to applying a DL model on heterogeneous datasets. In particular, by focusing on heart beat detection from Electrocardiogram signals (ECG), we show that the performance of a model trained on data from healthy subjects decreases when applied to patients with cardiac conditions and to signals collected with different devices. We then evaluate the use of Transfer Learning (TL) to adapt the model to the different datasets. In particular, we show that the classification performance is improved, even with datasets with a small sample size. These results suggest that a greater effort should be made towards generalizability of DL models applied on bioelectric signals, in particular by retrieving more representative datasets.
翻译:深层学习(DL)极大地促进了生物电信号处理,特别是提取生理标记;然而,文献中建议的结果的效力和适用性往往受用于培训模型的数据所代表的人口的限制。在本研究中,我们调查了在多样数据集中应用DL模型的相关问题。特别是,通过侧重于电心电图信号的心脏跳动检测,我们表明,在对健康对象数据进行训练的模型的性能在应用有心脏病症的病人和用不同装置收集的信号时会减少。然后,我们评估如何使用转移学习(TL)来使模型适应不同的数据集。特别是,我们表明,即使抽样规模小的数据集,分类性能也有所改善。这些结果表明,应更加努力争取生物电信号应用DL模型的普遍性,特别是重新寻找更具代表性的数据集。