The COVID-19 pandemic has emphasized the need for a robust understanding of epidemic models. Current models of epidemics are classified as either mechanistic or non-mechanistic: mechanistic models make explicit assumptions on the dynamics of disease, whereas non-mechanistic models make assumptions on the form of observed time series. Here, we introduce a simple mixture-based model which bridges the two approaches while retaining benefits of both. The model represents time series of cases and fatalities as a mixture of Gaussian curves, providing a flexible function class to learn from data compared to traditional mechanistic models. Although the model is non-mechanistic, we show that it arises as the natural outcome of a stochastic process based on a networked SIR framework. This allows learned parameters to take on a more meaningful interpretation compared to similar non-mechanistic models, and we validate the interpretations using auxiliary mobility data collected during the COVID-19 pandemic. We provide a simple learning algorithm to identify model parameters and establish theoretical results which show the model can be efficiently learned from data. Empirically, we find the model to have low prediction error. The model is available live at covidpredictions.mit.edu. Ultimately, this allows us to systematically understand the impacts of interventions on COVID-19, which is critical in developing data-driven solutions to controlling epidemics.
翻译:COVID-19流行病强调对流行病模式的正确理解的必要性。目前的流行病模式被归类为机械性或非机械性:机械性模型对疾病的动态作出明确的假设,而非机械性模型则对观察到的时间序列形式作出假设。这里,我们引入了一个简单的混合型模型,将两种方法联系起来,同时保留这两种方法的好处。模型代表了两种情况的时间序列和死亡,是高斯曲线的混合体,提供了一个灵活的功能类别,从数据中学习,与传统机械性模型相比较。虽然该模型不是机械性,但我们表明它是作为基于网络性SIR框架的随机过程的自然结果产生的。这样,与类似的非机械性模型相比,学习性模型可以进行更有意义的解释,我们用在COVID-19大流行病期间收集的辅助性流动数据来验证这些解释。我们提供了一种简单的学习算法,以确定模型参数和理论性结果,表明从数据中可以有效地学习模型。我们发现模型是低预测性的,但我们发现模型是低预测性错误的。这让学习的参数与类似的非机械性模型进行更有意义的解释,从而可以对最终影响进行控制。