The U.S. COVID-19 Forecast Hub aggregates forecasts of the short-term burden of COVID-19 in the United States from many contributing teams. We study methods for building an ensemble that combines forecasts from these teams. These experiments have informed the ensemble methods used by the Hub. To be most useful to policy makers, ensemble forecasts must have stable performance in the presence of two key characteristics of the component forecasts: (1) occasional misalignment with the reported data, and (2) instability in the relative performance of component forecasters over time. Our results indicate that in the presence of these challenges, an untrained and robust approach to ensembling using an equally weighted median of all component forecasts is a good choice to support public health decision makers. In settings where some contributing forecasters have a stable record of good performance, trained ensembles that give those forecasters higher weight can also be helpful.
翻译:美国COVID-19预报枢纽综合了许多派遣小组对美国COVID-19短期负担的预测。我们研究建立组合的方法,将这些小组的预测综合在一起。这些实验为中心使用的组合方法提供了参考。为了对决策者最有用,组合预测必须具有稳定的性能,因为组成部分预测有两个关键特征:(1) 偶尔与所报告的数据不协调,(2) 组成部分预测人员在一段时间内的相对性能不稳定。我们的结果表明,在面临这些挑战时,使用所有组成部分预测的同等加权中位数进行组合的不训练有素和稳健的方法是支持公共卫生决策者的好选择。 在一些提供预测者业绩良好的记录稳定的情况下,经过培训的组合可以提高预测者的重量。