水果-COV:一个高效的基于愿景的框架,通过记录咳声快速检测和诊断SARS-COV-2感染 (Fruit-CoV: An Efficient Vision-based Framework for Speedy Detection and Diagnosis of SARS-CoV-2 Infections Through Recorded Cough Sounds)

SARS-CoV-2 is colloquially known as COVID-19 that had an initial outbreak in December 2019. The deadly virus has spread across the world, taking part in the global pandemic disease since March 2020. In addition, a recent variant of SARS-CoV-2 named Delta is intractably contagious and responsible for more than four million deaths over the world. Therefore, it is vital to possess a self-testing service of SARS-CoV-2 at home. In this study, we introduce Fruit-CoV, a two-stage vision framework, which is capable of detecting SARS-CoV-2 infections through recorded cough sounds. Specifically, we convert sounds into Log-Mel Spectrograms and use the EfficientNet-V2 network to extract its visual features in the first stage. In the second stage, we use 14 convolutional layers extracted from the large-scale Pretrained Audio Neural Networks for audio pattern recognition (PANNs) and the Wavegram-Log-Mel-CNN to aggregate feature representations of the Log-Mel Spectrograms. Finally, we use the combined features to train a binary classifier. In this study, we use a dataset provided by the AICovidVN 115M Challenge, which includes a total of 7371 recorded cough sounds collected throughout Vietnam, India, and Switzerland. Experimental results show that our proposed model achieves an AUC score of 92.8% and ranks the 1st place on the leaderboard of the AICovidVN Challenge. More importantly, our proposed framework can be integrated into a call center or a VoIP system to speed up detecting SARS-CoV-2 infections through online/recorded cough sounds.

翻译：SARS-COV-2是2019年12月首次爆发的COVID-19。致命病毒自2020年3月以来在全世界传播,自2020年3月以来参与全球流行病。此外,最近一个名为Delta的SARS-COV-2变异体具有不易传染性,造成全世界400多万人死亡。因此,必须在国内拥有SAS-COV-2的自我测试服务。在这项研究中,我们引入了水果-COV-19,这是一个两阶段的愿景框架,能够通过记录咳嗽声音检测SARS-COV-2感染。具体地说,我们将声音转换成Log-Mel Spectrogram,并使用高效的网络V2网络来提取第一阶段的视觉特征。在第二阶段,我们使用从大规模预先培训的音频网络中提取的14个革命层来识别音频模式。Wavegram-Log-Mel-CNN可以将我们的LOPROD-C-C-CFS-CFS-C-CRV-2级阵列的汇总地标图示图示。最后,我们利用了92级的系统综合功能,在SHIFISOVS-S-CSODRIVRIDSODRDRDA 上收集的系统上, 数据中, 提供了一个完整一个完整的预级的预测的预测变变变变变变变变变变变数据,在SODRDODODODODODODODODODODODODRDRDO。在SDODOD 上,在SDA 上,在SDRDRDR 上提供了一个AODODODODODA 上,在SDODODODA 上提供了一个A 上,在SDODODOD上,在SDODODOD上提供了上提供了上提供了一个A 上提供了上提供了一个CUDODODODODA 上,在SUDODODODODODODI 的完整一个总的计算上,在这项研究中,在这项研究中,在

相关内容

Networking

关注 22

Networking：IFIP International Conferences on Networking。 Explanation：国际网络会议。 Publisher：IFIP。 SIT： http://dblp.uni-trier.de/db/conf/networking/index.html

【ICCV 2021 】Vision Transformer中的相对位置编码

专知会员服务

30+阅读 · 2021年7月30日

【百度】 PP-YOLOv2使用目标检测器

专知会员服务

18+阅读 · 2021年4月24日

【文献综述】Text Detection and Recognition in the Wild: A Review 自然文本检测与识别

专知会员服务

46+阅读 · 2020年6月11日

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日