COVID 19 is an acute disease that started spreading throughout the world, beginning in December 2019. It has spread worldwide and has affected more than 7 million people, and 200 thousand people have died due to this infection as of Oct 2020. In this paper, we have forecasted the number of deaths and the confirmed cases in Los Angeles and New York of the United States using the traditional and Big Data platforms based on the Times Series: ARIMA and ETS. We also implemented a more sophisticated time-series forecast model using Facebook Prophet API. Furthermore, we developed the classification models: Logistic Regression and Random Forest regression to show that the Weather does not affect the number of the confirmed cases. The models are built and run in legacy systems (Azure ML Studio) and Big Data systems (Oracle Cloud and Databricks). Besides, we present the accuracy of the models.
翻译:COVID 19 是一种从2019年12月开始在全世界蔓延的急性疾病,从2019年12月开始,它在全世界蔓延,已经影响到700多万人,截至2020年10月,已有20万人因这种感染而死亡。在本文中,我们利用基于《时报系列:ARIMA和ETS》的传统和大数据平台预测了洛杉矶和美国纽约的死亡人数和确诊病例数。我们还使用Facebook先知API实施了更复杂的时间序列预测模型。此外,我们开发了分类模型:后勤倒退和随机森林回归,以表明天气并不影响已确认病例数。模型是在遗留系统(Azurre ML演播室)和大数据系统(Oracle Cloud and Databricks)中建立和运行的。此外,我们介绍了模型的准确性。