In this paper, we propose a deep residual network-based method, namely the DiCOVA-Net, to identify COVID-19 infected patients based on the acoustic recording of their coughs. Since there are far more healthy people than infected patients, this classification problem faces the challenge of imbalanced data. To improve the model's ability to recognize minority class (the infected patients), we introduce data augmentation and cost-sensitive methods into our model. Besides, considering the particularity of this task, we deploy some fine-tuning techniques to adjust the pre-training ResNet50. Furthermore, to improve the model's generalizability, we use ensemble learning to integrate prediction results from multiple base classifiers generated using different random seeds. To evaluate the proposed DiCOVA-Net's performance, we conducted experiments with the DiCOVA challenge dataset. The results show that our method has achieved 85.43\% in AUC, among the top of all competing teams.
翻译:在本文中,我们提出一种基于网络的深残方法,即DiCOVA-Net,以根据对咳嗽的声学记录确定COVID-19受感染的病人。由于健康的人比受感染的病人多得多,这一分类问题面临着数据不平衡的挑战。为了提高模型识别少数群体(受感染的病人)的能力,我们将数据扩增和成本敏感的方法引入我们的模型。此外,考虑到这项任务的特殊性,我们运用一些微调技术来调整培训前的ResNet50。此外,为了改进模型的可概括性,我们利用共同学习将使用不同随机种子生成的多个基础分类器的预测结果综合起来。为了评估提议的DiCOVA-Net的性能,我们用DiCOVA挑战数据集进行了实验。结果显示,我们的方法在AUC中达到了85.43 ⁇,在所有相互竞争的团队中达到了85.43 ⁇ 。