COVID-19 pandemic has an unprecedented impact all over the world since early 2020. During this public health crisis, reliable forecasting of the disease becomes critical for resource allocation and administrative planning. The results from compartmental models such as SIR and SEIR are popularly referred by CDC and news media. With more and more COVID-19 data becoming available, we examine the following question: Can a direct data-driven approach without modeling the disease spreading dynamics outperform the well referred compartmental models and their variants? In this paper, we show the possibility. It is observed that as COVID-19 spreads at different speed and scale in different geographic regions, it is highly likely that similar progression patterns are shared among these regions within different time periods. This intuition lead us to develop a new neural forecasting model, called Attention Crossing Time Series (\textbf{ACTS}), that makes forecasts via comparing patterns across time series obtained from multiple regions. The attention mechanism originally developed for natural language processing can be leveraged and generalized to materialize this idea. Among 13 out of 18 testings including forecasting newly confirmed cases, hospitalizations and deaths, \textbf{ACTS} outperforms all the leading COVID-19 forecasters highlighted by CDC.
翻译:自2020年初以来,COVID-19大流行在全世界产生了前所未有的影响。在这场公共卫生危机期间,可靠的疾病预报成为资源分配和行政规划的关键。SIR和SEIR等不同模式的结果被CDC和新闻媒体广泛引用。随着越来越多的COVID-19数据被提供,我们研究以下问题:直接数据驱动的方法,而不将疾病传播的动态传播模式建模,能否超越周密分类模式及其变体?在本文件中,我们展示了这种可能性。据观察,由于COVID-19在不同地理区域以不同的速度和规模扩散,这些区域极有可能在不同的时间段内共享类似的进展模式。这种直觉引导我们开发了一个新的神经预测模式,称为注意跨时间系列(\ textbf{ACTS}),通过比较从多个区域获得的时间序列的格局进行预测。最初为自然语言处理开发的注意机制可以被利用和普及,从而实现这一理念。在18个测试中,有13个测试包括预测新确认的病例、住院和死亡,\ trextbfACTS}所有C的预测都突出了CVINS的预测。