Influenza is an infectious disease with the potential to become a pandemic, and hence, forecasting its prevalence is an important undertaking for planning an effective response. Research has found that web search activity can be used to improve influenza models. Neural networks (NN) can provide state-of-the-art forecasting accuracy but do not commonly incorporate uncertainty in their estimates, something essential for using them effectively during decision making. In this paper, we demonstrate how Bayesian Neural Networks (BNNs) can be used to both provide a forecast and a corresponding uncertainty without significant loss in forecasting accuracy compared to traditional NNs. Our method accounts for two sources of uncertainty: data and model uncertainty, arising due to measurement noise and model specification, respectively. Experiments are conducted using 14 years of data for England, assessing the model's accuracy over the last 4 flu seasons in this dataset. We evaluate the performance of different models including competitive baselines with conventional metrics as well as error functions that incorporate uncertainty estimates. Our empirical analysis indicates that considering both sources of uncertainty simultaneously is superior to considering either one separately. We also show that a BNN with recurrent layers that models both sources of uncertainty yields superior accuracy for these metrics for forecasting horizons greater than 7 days.
翻译:研究发现,网络搜索活动可以用来改进流感模型。神经网络(NN)可以提供最新预报准确性,但通常不在其估计数中包含不确定性,而这种不确定性对于在决策过程中有效使用这些不确定性至关重要。在本文中,我们演示如何利用巴耶西亚神经网络(BNN)提供预测和相应的不确定性,而不比传统NN的预测准确性大大降低。我们的方法说明两种不确定性来源:分别因测量噪音和模型规格而产生的数据和模型不确定性。实验使用英国14年的数据,评估模型在过去4个流感季节的准确性,评估模型在数据集中在过去4个流感季节的准确性。我们评估不同模型的性能,包括具有竞争性的基线和包含不确定性估计数的常规指标以及错误功能。我们的经验分析表明,两种不确定性来源同时考虑优于单独考虑两种来源。我们还表明,一个具有经常层的BNNN,两种来源的不确定性都比这些指标的精确性要高。我们还表明,两种来源的不确定性都比7天的精确度预测率更高。