SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing service of SARS-CoV-2 at home. In this study, we introduce Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. Specifically, we convert sounds into Log-Mel Spectrograms and use the EfficientNet-V2 network to extract its visual features in the first stage. In the second stage, we use 14 convolutional layers extracted from the large-scale Pretrained Audio Neural Networks for audio pattern recognition (PANNs) and the Wavegram-Log-Mel-CNN to aggregate feature representations of the Log-Mel Spectrograms. Finally, we use the combined features to train a binary classifier. In this study, we use a dataset provided by the AICovidVN 115M Challenge, which includes a total of 7371 recorded cough sounds collected throughout Vietnam, India, and Switzerland. Experimental results show that our proposed model achieves an AUC score of 92.8% and ranks the 1st place on the leaderboard of the AICovidVN Challenge. More importantly, our proposed framework can be integrated into a call center or a VoIP system to speed up detecting SARS-CoV-2 infections through online/recorded cough sounds.
翻译:SARS-COV-2是2019年12月首次爆发的COVID-19。致命病毒自2020年3月以来在全世界传播,自2020年3月以来参与全球流行病。此外,最近一个名为Delta的SARS-COV-2变异体具有不易传染性,造成全世界400多万人死亡。因此,必须在国内拥有SAS-COV-2的自我测试服务。在这项研究中,我们引入了水果-COV-19,这是一个两阶段的愿景框架,能够通过记录咳嗽声音检测SARS-COV-2感染。具体地说,我们将声音转换成Log-Mel Spectrogram,并使用高效的网络V2网络来提取第一阶段的视觉特征。在第二阶段,我们使用从大规模预先培训的音频网络中提取的14个革命层来识别音频模式。Wavegram-Log-Mel-CNN可以将我们的LOPROD-C-C-CFS-CFS-C-CRV-2级阵列的汇总地标图示图示。最后,我们利用了92级的系统综合功能,在SHIFISOVS-S-CSODRIVRIDSODRDRDA 上收集的系统上, 数据中, 提供了一个完整一个完整的预级的预测的预测变变变变变变变变变变变数据,在SODRDODODODODODODODODODODODODRDRDO。在SDODOD 上,在SDA 上,在SDRDRDR 上提供了一个AODODODODODA 上,在SDODODODA 上提供了一个A 上,在SDODODOD上,在SDODODOD上提供了上提供了上提供了一个A 上提供了上提供了一个CUDODODODODA 上,在SUDODODODODODODI 的完整一个总的计算上,在这项研究中,在这项研究中,在