The current COVID-19 pandemic is now getting contained, albeit at the cost of morethan2.3million human lives. A critical phase in any pandemic is the early detection of cases to develop preventive treatments and strategies. In the case of COVID-19,several studies have indicated that chest radiography images of the infected patients show characteristic abnormalities. However, at the onset of a given pandemic, such asCOVID-19, there may not be sufficient data for the affected cases to train models for their robust detection. Hence, supervised classification is ill-posed for this problem because the time spent in collecting large amounts of data from infected persons could lead to the loss of human lives and delays in preventive interventions. Therefore, we formulate the problem of identifying early cases in a pandemic as an anomaly detection problem, in which the data for healthy patients is abundantly available, whereas no training data is present for the class of interest (COVID-19 in our case). To solve this problem, we present several unsupervised deep learning approaches, including convolutional and adversarially trained autoencoder. We tested two settings on a publicly available dataset (COVIDx)by training the model on chest X-rays from (i) only healthy adults, and (ii) healthy and other non-COVID-19 pneumonia, and detected COVID-19 as an anomaly. Afterperforming3-fold cross validation, we obtain a ROC-AUC of0.765. These results are very encouraging and pave the way towards research for ensuring emergency preparedness in future pandemics, especially the ones that could be detected from chest X-rays
翻译:目前COVID-19大流行病正在得到遏制,尽管其代价是超过2,300万人的生命;任何大流行病的一个关键阶段是早期发现病例,以发展预防性治疗和战略;就COVID-19而言,若干研究表明,受感染病人的胸部射线图像显示有异常特征;然而,在某种流行病,如COVID-19爆发时,受影响病例可能没有足够的数据来训练能进行强有力检测的模型;因此,由于从受感染者那里收集大量数据所花费的时间可能导致生命损失和预防性干预的延误,因此,受监督的分类是不妥当的;因此,由于从受感染者那里收集大量数据所花费的时间可能会导致人类生命损失和预防性干预的延误,因此,我们把确定流行病早期病例的问题作为一个异常的检测问题,在这种疾病中,健康病人的数据是完全存在的,而没有提供培训的数据(COVID-19在我们的案件中),为了解决这一问题,我们提出了一些未经监督的深层次学习方法,包括快速和经过敌对训练的血液解剖。 我们在公开提供的数据(COD-65-VIxxx)中测试两种环境,特别是从RO-D-RO-RO-ROV3,从健康-Sudal3,从我们检测到健康-Siral-ral-rual3,从健康-rual-vial-vial3,我们只能测出一个健康-ral-ral-ral-exx3,从一个健康-ral-ex-ral-ral-ral-de-ex-3,我们检测到一个健康-de-ralismex-abal-rm-ral-ralisalisal-