After more than two years since the beginning of the COVID-19 pandemic, the pressure of this crisis continues to devastate globally. The use of chest X-ray (CXR) imaging as a complementary screening strategy to RT-PCR testing is not only prevailing but has greatly increased due to its routine clinical use for respiratory complaints. Thus far, many visual perception models have been proposed for COVID-19 screening based on CXR imaging. Nevertheless, the accuracy and the generalization capacity of these models are very much dependent on the diversity and the size of the dataset they were trained on. Motivated by this, we introduce COVIDx CXR-3, a large-scale benchmark dataset of CXR images for supporting COVID-19 computer vision research. COVIDx CXR-3 is composed of 30,386 CXR images from a multinational cohort of 17,026 patients from at least 51 countries, making it, to the best of our knowledge, the most extensive, most diverse COVID-19 CXR dataset in open access form. Here, we provide comprehensive details on the various aspects of the proposed dataset including patient demographics, imaging views, and infection types. The hope is that COVIDx CXR-3 can assist scientists in advancing computer vision research against the COVID-19 pandemic.
翻译:自COVID-19大流行开始两年多以来,这一危机的压力继续在全球消失。胸X光(CXR)成像作为RT-PCR测试的补充筛查战略的使用不仅普遍,而且由于例行临床用于呼吸系统投诉,已经大大增加了。到目前为止,已经为CXR成像的COVID-19筛查提出了许多视觉认知模型。然而,这些模型的准确性和普及能力在很大程度上取决于它们所培训的数据集的多样性和规模。我们为此引入了COVIDx CXR-3-3的大规模基准数据集,用于支持COVID-19计算机视觉研究。COVID CXR-3由来自至少51个国家的17,026名病人组成的多国组群的30,386 CXR图像组成。根据我们所知,这些模型的准确性和普及能力在很大程度上取决于它们所培训的数据集的多样性和规模。我们在这里提供了有关CVID图像的多个方面的全面细节,其中包括:CVVI的诊断性研究、CVI的希望。