Recent work has reported that AI classifiers trained on audio recordings can accurately predict severe acute respiratory syndrome coronavirus 2 (SARSCoV2) infection status. Here, we undertake a large scale study of audio-based deep learning classifiers, as part of the UK governments pandemic response. We collect and analyse a dataset of audio recordings from 67,842 individuals with linked metadata, including reverse transcription polymerase chain reaction (PCR) test outcomes, of whom 23,514 tested positive for SARS CoV 2. Subjects were recruited via the UK governments National Health Service Test-and-Trace programme and the REal-time Assessment of Community Transmission (REACT) randomised surveillance survey. In an unadjusted analysis of our dataset AI classifiers predict SARS-CoV-2 infection status with high accuracy (Receiver Operating Characteristic Area Under the Curve (ROCAUC) 0.846 [0.838, 0.854]) consistent with the findings of previous studies. However, after matching on measured confounders, such as age, gender, and self reported symptoms, our classifiers performance is much weaker (ROC-AUC 0.619 [0.594, 0.644]). Upon quantifying the utility of audio based classifiers in practical settings, we find them to be outperformed by simple predictive scores based on user reported symptoms.
翻译:最近的工作报告称,接受过声音录音培训的AI分类人员可以准确预测严重急性呼吸道综合症冠状病毒2 (SARSCOV2)感染状况。我们在这里对基于声音的深学习分类人员进行大规模研究,作为英国政府大流行病对策的一部分。我们收集和分析了67,842名具有相关元数据的个人的录音数据集,包括反转转聚合酶链反应(PCR)测试结果,其中23,514人通过SARS CoV2.测试测试测试结果呈阳性,通过联合王国政府国家保健服务测试和跟踪方案以及社区传播实时评估随机监测调查,在对数据集的AI分类人员进行未经调整的分析中,根据以往研究结果,以高精确度预测SARS-COV-2感染状况(RCAUC下接收者操作特征区)0.846 [0.838,0.854]。然而,在对测量的聚合者进行匹配后,例如年龄、性别和自报症状,我们的分类工作表现非常差(ROC-AUC 0.694,根据简单用户的预测,根据我们使用的等级,以0.6-0.6的等级,报告根据简单的用户的等级,根据我们的标准,从标准,从0.69的分类,从10到分数级的等级,从10到分级。)。</s>