The present work proposes a deep-learning-based approach for the classification of COVID-19 coughs from non-COVID-19 coughs and that can be used as a low-resource-based tool for early detection of the onset of such respiratory diseases. The proposed system uses the ResNet-50 architecture, a popularly known Convolutional Neural Network (CNN) for image recognition tasks, fed with the log-Mel spectrums of the audio data to discriminate between the two types of coughs. For the training and validation of the proposed deep learning model, this work utilizes the Track-1 dataset provided by the DiCOVA Challenge 2021 organizers. Additionally, to increase the number of COVID-positive samples and to enhance variability in the training data, it has also utilized a large open-source database of COVID-19 coughs collected by the EPFL CoughVid team. Our developed model has achieved an average validation AUC of 98.88%. Also, applying this model on the Blind Test Set released by the DiCOVA Challenge, the system has achieved a Test AUC of 75.91%, Test Specificity of 62.50%, and Test Sensitivity of 80.49%. Consequently, this submission has secured 16th position in the DiCOVA Challenge 2021 leader-board.
翻译:目前的工作提出了一种深层次的学习方法,用于对非COVID-19咳咳的COVID-19咳咳进行分类,这种方法可以作为一种低资源工具,用于早期发现这种呼吸道疾病的发病,拟议的系统使用ResNet-50结构,即众所周知的革命神经神经网络网络(CNN)来进行图像识别任务,并用声频数据的日志光谱来对两种类型的咳嗽进行区分。为培训和验证拟议的深层次学习模式,这项工作利用DiCOVA挑战2021组织者提供的轨道1数据集。此外,为了增加COVID阳性样本的数量,并增强培训数据的变异性,该系统还利用了EPFL CoughVid团队收集的COVID-19咳的大型开放源数据库。我们开发的模型实现了平均校验ACUC98.88%。此外,在DiCOVA挑战中发布的盲人测试测试测试成套模型已经达到75.91%,测试规格为62.50%,测试规格为16.50%,测试前导点为20VA 。