The rapid and accurate detection of COVID-19 cases is critical for timely treatment and preventing the spread of the disease. In this study, a two-stage feature extraction framework using eight state-of-the-art pre-trained deep Convolutional Neural Networks (CNNs) and an autoencoder is proposed to determine the health conditions of patients (COVID-19, Normal, Viral Pneumonia) based on chest X-rays. The X-ray scans are divided into four equally sized sections and analyzed by deep pre-trained CNNs. Subsequently, an autoencoder with three hidden layers is trained to extract reproductive features from the concatenated ouput of CNNs. To evaluate the performance of the proposed framework, three different classifiers, which are single-layer perceptron (SLP), multi-layer perceptron (MLP), and support vector machine (SVM) are used. Furthermore, the deep CNN architectures are used to create benchmark models and trained on the same dataset for comparision. The proposed framework outperforms other frameworks wih pre-trained feature extractors in binary classification and shows competitive results in three-class classification. The proposed methodology is task-independent and suitable for addressing various problems. The results show that the discriminative features are a subset of the reproductive features, suggesting that extracting task-independent features is superior to the extraction only task-based features. The flexibility and task-independence of the reproductive features make the conceptive information approach more favorable. The proposed methodology is novel and shows promising results for analyzing medical image data.
翻译:翻译后的摘要:
COVID-19病例的快速准确检测对及时治疗和预防疾病传播至关重要。本研究提出了一个使用8个最新的预训练深度卷积神经网络(CNNs)和一个自编码器的两阶段特征提取框架,用于根据胸部X光片确定患者的健康状况(COVID-19,正常,病毒性肺炎)。将X射线扫描分为四个等大小的部分,并由深度预训练CNN对其进行分析。随后,训练一个具有三个隐藏层的自编码器,从CNN的连接输出中提取生成性特征。为评估所提出的框架的性能,使用了三种不同的分类器,即单层感知器(SLP),多层感知器(MLP)和支持向量机(SVM)。此外,使用深度CNN架构创建基准模型,并使用相同的数据集进行训练进行比较。所提出的框架在二分类中优于使用预训练特征提取器的其他框架,并在三类分类中显示出竞争性结果。研究结果表明,鉴别性特征是生成性特征的一个子集,提示提取基于任务无关性特征比提取基于任务特定性特征更优。生成性特征的灵活性和任务无关性使得理解信息的方法更受欢迎。所提出的方法是全新的,对于分析医学图像数据显示出很有前途的结果。