In this work, we study the pandemic course in the United States by considering national and state levels data. We propose and compare multiple time-series prediction techniques which incorporate auxiliary variables. One type of approach is based on spatio-temporal graph neural networks which forecast the pandemic course by utilizing a hybrid deep learning architecture and human mobility data. Nodes in this graph represent the state-level deaths due to COVID-19, edges represent the human mobility trend and temporal edges correspond to node attributes across time. The second approach is based on a statistical technique for COVID-19 mortality prediction in the United States that uses the SARIMA model and eXogenous variables. We evaluate these techniques on both state and national levels COVID-19 data in the United States and claim that the SARIMA and MCP models generated forecast values by the eXogenous variables can enrich the underlying model to capture complexity in respectively national and state levels data. We demonstrate significant enhancement in the forecasting accuracy for a COVID-19 dataset, with a maximum improvement in forecasting accuracy by 64.58% and 59.18% (on average) over the GCN-LSTM model in the national level data, and 58.79% and 52.40% (on average) over the GCN-LSTM model in the state level data. Additionally, our proposed model outperforms a parallel study (AUG-NN) by 27.35% improvement of accuracy on average.
翻译:在这项工作中,我们通过考虑国家和州一级数据,研究美国的大流行病课程;我们提出并比较包含辅助变量的多重时间序列预测技术;一种方法是基于spatio-时表神经网络,利用混合深层学习结构和人类流动数据,预测该流行病课程;本图中的节点代表了由于COVID-19造成的州一级死亡,边缘代表了人类流动趋势和时间边缘与时间的节点特征相对应;第二种方法是基于使用SARIMA模型和eXgenous变量的美国COVID-19死亡率预测的统计技术;一种是基于美国州和国家两级COVID-19型神经神经网络,利用混合深层学习结构和人类流动数据进行预测。本图中的SARIMA和MCP模型能够丰富基本模型,分别反映国家和州一级数据的复杂程度。我们显示COVID-19数据集的预测准确性显著提高,在GCN-LS-19模型中,比G-LS-LSM平均数据水平提高了64.58%和59.18%(平均)。