Stochastic epidemic models (SEMs) fit to incidence data are critical to elucidating outbreak dynamics, shaping response strategies, and preparing for future epidemics. SEMs typically represent counts of individuals in discrete infection states using Markov jump processes (MJPs), but are computationally challenging as imperfect surveillance, lack of subject-level information, and temporal coarseness of the data obscure the true epidemic. Analytic integration over the latent epidemic process is impossible, and integration via Markov chain Monte Carlo (MCMC) is cumbersome due to the dimensionality and discreteness of the latent state space. Simulation-based computational approaches can address the intractability of the MJP likelihood, but are numerically fragile and prohibitively expensive for complex models. A linear noise approximation (LNA) that approximates the MJP transition density with a Gaussian density has been explored for analyzing prevalence data in large-population settings, but requires modification for analyzing incidence counts without assuming that the data are normally distributed. We demonstrate how to reparameterize SEMs to appropriately analyze incidence data, and fold the LNA into a data augmentation MCMC framework that outperforms deterministic methods, statistically, and simulation-based methods, computationally. Our framework is computationally robust when the model dynamics are complex and applies to a broad class of SEMs. We evaluate our method in simulations that reflect Ebola, influenza, and SARS-CoV-2 dynamics, and apply our method to national surveillance counts from the 2013--2015 West Africa Ebola outbreak.
翻译:适合发病率数据的沙粒流行病模型(SEM)对于澄清爆发爆发动态、制定应对策略和为未来流行病做准备至关重要。基于模拟的计算方法通常代表使用Markov跳跃过程(MJPs)的离散感染国个人数量,但计算上具有挑战性,因为监测不完善,缺乏主题信息,数据的时间粗糙掩盖了真正的流行病。在潜在流行病进程中不可能进行分析整合,而通过Markov链 Monte Carlo(MCMC)的整合由于潜伏状态空间的维度和离散性而十分繁琐。基于模拟的计算方法可以解决MJP可能性的可选性,但对于复杂模型而言,其数字脆弱和过于昂贵。 已经探索了线性噪音近似MJP过渡密度,高斯密度掩盖了真正的流行病数据,但在不假定数据通常以我们的数据为基础的情况下,则需要修改发生率计算。我们展示了如何将SEM重新测量成适当分析发生率数据的方法,并将LNA-2的动态模型化为数据增强型的模型,在SARMS-MS的模型中,我们统计-CMS-C的模型计算方法是超越了我们的统计-CRireal-C-mo化方法。