关于根据12先导ECG进行综合地物工程和深学习以进行诊断、风险预防以及年龄估计的合并地物工程和深学习 (On Merging Feature Engineering and Deep Learning for Diagnosis, Risk-Prediction and Age Estimation Based on the 12-Lead ECG)

Objective: Machine learning techniques have been used extensively for 12-lead electrocardiogram (ECG) analysis. For physiological time series, deep learning (DL) superiority to feature engineering (FE) approaches based on domain knowledge is still an open question. Moreover, it remains unclear whether combining DL with FE may improve performance. Methods: We considered three tasks intending to address these research gaps: cardiac arrhythmia diagnosis (multiclass-multilabel classification), atrial fibrillation risk prediction (binary classification), and age estimation (regression). We used an overall dataset of 2.3M 12-lead ECG recordings to train the following models for each task: i) a random forest taking the FE as input was trained as a classical machine learning approach; ii) an end-to-end DL model; and iii) a merged model of FE+DL. Results: FE yielded comparable results to DL while necessitating significantly less data for the two classification tasks and it was outperformed by DL for the regression task. For all tasks, merging FE with DL did not improve performance over DL alone. Conclusion: We found that for traditional 12-lead ECG based diagnosis tasks DL did not yield a meaningful improvement over FE, while it improved significantly the nontraditional regression task. We also found that combining FE with DL did not improve over DL alone which suggests that the FE were redundant with the features learned by DL. Significance: Our findings provides important recommendations on what machine learning strategy and data regime to chose with respect to the task at hand for the development of new machine learning models based on the 12-lead ECG.

翻译：目标:机器学习技术已被广泛用于12级领先心电图(ECG)分析。对于生理时间序列而言,基于域知识的深度学习(DL)优于地貌工程(FE)方法仍是一个尚未解决的问题。此外,尚不清楚将DL与FE相结合能否提高绩效。方法:我们考虑了旨在解决这些研究差距的三项任务:心脏失常诊断(多等多标签分类分类),工时纤维化风险预测(二等分类)和年龄估计(回归)。我们使用了一个由2.3M 12级领先ECG录音组成的总体数据集来为每项任务培训以下模式:一是随机森林,将FE作为投入作为经典机器学习方法加以培训;二是端至端DL模型;三是FE+DL的合并模型。结果:FE得出了可比较的结果,而两项分类任务则需要大大减少数据,而DL值则比DL的回归任务要高得多。对于所有任务而言,将FE和DL合并的功能与DL没有改善业绩,而DL本身没有将FL作为重要的分析结果。我们发现,而传统的FL系统也明显地改进了。我们发现,而没有改进了FL在12级的改进了。