This paper explains the scalable methods used for extracting and analyzing the Covid-19 vaccine data. Using Big Data such as Hadoop and Hive, we collect and analyze the massive data set of the confirmed, the fatality, and the vaccination data set of Covid-19. The data size is about 3.2 Giga-Byte. We show that it is possible to store and process massive data with Big Data. The paper proceeds tempo-spatial analysis, and visual maps, charts, and pie charts visualize the result of the investigation. We illustrate that the more vaccinated, the fewer the confirmed cases.
翻译:本文解释了用于提取和分析Covid-19疫苗数据的可缩放方法。 我们使用大数据,如Hadoop和Hive,收集和分析Covid-19的已证实的大规模数据集、死亡率和疫苗接种数据集。 数据大小约为3.2 Giga-Byte。 我们用大数据来存储和处理大规模数据是可能的。 文件进行了速拍空间分析,以及直观的地图、图表和馅饼图表,可以对调查结果进行可视化。 我们说明接种疫苗越多,确诊病例越少。