As the coronavirus disease 2019 (COVID-19) becomes a global pandemic, policy makers must enact interventions to stop its spread. Data driven approaches might supply information to support the implementation of mitigation and suppression strategies. To facilitate research in this direction, we present a machine-readable dataset that aggregates relevant data from governmental, journalistic, and academic sources on the county level. In addition to county-level time-series data from the JHU CSSE COVID-19 Dashboard, our dataset contains more than 300 variables that summarize population estimates, demographics, ethnicity, housing, education, employment and in come, climate, transit scores, and healthcare system-related metrics. Furthermore, we present aggregated out-of-home activity information for various points of interest for each county, including grocery stores and hospitals, summarizing data from SafeGraph. By collecting these data, as well as providing tools to read them, we hope to aid researchers investigating how the disease spreads and which communities are best able to accommodate stay-at-home mitigation efforts. Our dataset and associated code are available at https://github.com/JieYingWu/COVID-19_US_County-level_Summaries.
翻译:随着2019年科罗纳病毒疾病(COVID-19)成为全球流行病,决策者必须制定干预措施以阻止其传播,以数据驱动的方法可以提供信息,支持执行减缓和抑制战略;为便利这方面的研究,我们提供一台机器可读数据集,汇总来自政府、新闻和州一级的学术来源的相关数据;除了从JHU COVID-19 Dashboard(JHU CHSE COVID-19 Dashboard)获得的州级时间序列数据外,我们的数据集包含300多个变量,这些变量总结了人口估计、人口统计、族裔、住房、教育、就业以及未来、气候、过境分数和与保健系统有关的计量。此外,我们还为每个州的不同利益点,包括杂货店和医院,提供了汇总SafeGraph数据的综合家庭外活动信息。通过收集这些数据和提供工具阅读这些数据,我们希望帮助研究人员调查疾病传播情况以及哪些社区最能适应住在家里的缓解工作。我们的数据集和相关代码可在https://github.com/Jie-YWu/COVI/COVI/提供工具。