Infectious disease forecasting is of great interest to the public health community and policymakers, since forecasts can provide insight into disease dynamics in the near future and inform interventions. Due to delays in case reporting, however, forecasting models may often underestimate the current and future disease burden. In this paper, we propose a general framework for addressing reporting delay in disease forecasting efforts with the goal of improving forecasts. We propose strategies for leveraging either historical data on case reporting or external internet-based data to estimate the amount of reporting error. We then describe several approaches for adapting general forecasting pipelines to account for under- or over-reporting of cases. We apply these methods to address reporting delay in data on dengue fever cases in Puerto Rico from 1990 to 2009 and to reports of influenza-like illness (ILI) in the United States between 2010 and 2019. Through a simulation study, we compare method performance and evaluate robustness to assumption violations. Our results show that forecasting accuracy and prediction coverage almost always increase when correction methods are implemented to address reporting delay. Some of these methods required knowledge about the reporting error or high quality external data, which may not always be available. Provided alternatives include excluding recently-reported data and performing sensitivity analysis. This work provides intuition and guidance for handling delay in disease case reporting and may serve as a useful resource to inform practical infectious disease forecasting efforts.
翻译:公共卫生界和决策者对传染病预测非常感兴趣,因为预测可以在最近的将来对疾病动态提供洞察力,并为干预措施提供信息。但是,由于报告案例方面的延误,预测模型往往会低估当前和未来的疾病负担。在本文件中,我们提议了一个总体框架,以解决疾病预测工作报告延迟的报告问题,目的是改进预测。我们提出了利用案例报告历史数据或外部互联网数据的战略,以估计报告错误的数量。我们接着描述了调整一般预测管道的若干方法,以考虑到病例报告不足或过多的情况。我们采用这些方法是为了解决1990年至2009年波多黎各登革热病例数据以及2010年至2019年美国类似流感疾病报告延迟的报告延迟的问题。我们通过模拟研究,比较方法性能并评价违反假设的稳健性。我们的结果显示,在采用纠正方法处理报告延误问题时,预测准确性和预测覆盖面几乎总是增加。有些方法需要了解报告错误或高质量的外部数据,而这些方法可能并非总能提供。我们采用的替代方法包括排除最近报告的数据和进行敏感度分析。这项工作可以提供直观和为疾病预测工作提供资源指导。