We propose a novel approach that integrates machine learning into compartmental disease modeling to predict the progression of COVID-19. Our model is explainable by design as it explicitly shows how different compartments evolve and it uses interpretable encoders to incorporate covariates and improve performance. Explainability is valuable to ensure that the model's forecasts are credible to epidemiologists and to instill confidence in end-users such as policy makers and healthcare institutions. Our model can be applied at different geographic resolutions, and here we demonstrate it for states and counties in the United States. We show that our model provides more accurate forecasts, in metrics averaged across the entire US, than state-of-the-art alternatives, and that it provides qualitatively meaningful explanatory insights. Lastly, we analyze the performance of our model for different subgroups based on the subgroup distributions within the counties.
翻译:我们提出一种新的方法,将机器学习纳入部门疾病模型,以预测COVID-19的进展。我们的模式可以通过设计来解释,因为它清楚地表明了不同区块的演变方式,并且使用了可解释的编码器来纳入共变和绩效。解释对于确保模型的预测对流行病学家可信,并且对决策者和保健机构等终端用户产生信心是有价值的。我们的模型可以应用于不同的地理分辨率,我们在这里为美国各州和各州演示。我们显示,我们的模型提供了比最新替代方法更准确的预测,以全美平均指标衡量,而不是最新指标,它提供了质量上有意义的解释性洞察力。最后,我们根据各州内的分组分布分析了我们不同分组模式的绩效。