Across the world's coronavirus disease 2019 (COVID-19) hot spots, the need to streamline patient diagnosis and management has become more pressing than ever. As one of the main imaging tools, chest X-rays (CXRs) are common, fast, non-invasive, relatively cheap, and potentially bedside to monitor the progression of the disease. This paper describes the first public COVID-19 image data collection as well as a preliminary exploration of possible use cases for the data. This dataset currently contains hundreds of frontal view X-rays and is the largest public resource for COVID-19 image and prognostic data, making it a necessary resource to develop and evaluate tools to aid in the treatment of COVID-19. It was manually aggregated from publication figures as well as various web based repositories into a machine learning (ML) friendly format with accompanying dataloader code. We collected frontal and lateral view imagery and metadata such as the time since first symptoms, intensive care unit (ICU) status, survival status, intubation status, or hospital location. We present multiple possible use cases for the data such as predicting the need for the ICU, predicting patient survival, and understanding a patient's trajectory during treatment. Data can be accessed here: https://github.com/ieee8023/covid-chestxray-dataset
翻译:2019年(COVID-19)全球科罗纳病毒疾病(COVID-19)热点,简化患者诊断和管理的需要比以往更加迫切。作为主要成像工具之一,胸X光(CXRs)是常见的、快速的、非侵入的、相对廉价的,而且有可能是监测该疾病发展情况的主要成像工具之一。本文描述了首次公共COVID-19图像数据收集以及对数据可能使用案例的初步探索。该数据集目前包含数百张前视X光片,是COVID-19图像和预测性数据的最大公共资源,使得它成为开发和评价帮助治疗COVID-19的工具的必要资源。它由出版数字和各种基于网络的储存库手工汇总成机器学习(ML)友好格式,并附有数据载荷代码。我们收集了第一症状、强化护理单位状况、生存状态、内置状态或医院位置等时间的图像和元数据。我们在这里展示了多种可能的案例,用于对数据库/内存数据进行预测。我们在这里可以使用数据库/CUD数据库进行预测。