Stochastic epidemic models provide an interpretable probabilistic description of the spread of a disease through a population. Yet, fitting these models to partially observed data is a notoriously difficult task due to intractability of the likelihood for many classical models. To remedy this issue, this article introduces a novel data-augmented MCMC algorithm for exact Bayesian inference under the stochastic SIR model, given only discretely observed counts of infection. In a Metropolis-Hastings step, the latent data are jointly proposed from a surrogate process carefully designed to closely resemble the SIR model, from which we can efficiently generate epidemics consistent with the observed data. This yields a method that explores the high-dimensional latent space efficiently, and scales to outbreaks with hundreds of thousands of individuals. We show that the Markov chain underlying the algorithm is uniformly ergodic, and validate its performance via thorough simulation experiments and a case study on the 2013-2015 outbreak of Ebola Haemorrhagic Fever in Western Africa.
翻译:然而,将这些模型与部分观测到的数据相匹配是一项众所周知的困难的任务,因为许多古典模型的可能性是难以理解的。为解决这一问题,本文章引入了一种新的数据强化MCMC算法,用于根据随机性SIR模型精确推断巴伊西亚人,仅以单独观察到的感染数为根据。在大都会-哈斯廷斯步骤中,潜在数据是由一个精心设计的代用程序联合提出的,该代用程序与SIR模型非常相似,从中我们可以有效地产生与所观察到的数据一致的流行病。这产生了一种方法,可以高效地探索高维潜在空间,并测量成千万人的爆发规模。我们表明,用于算法的马尔科夫链是统一的,通过彻底的模拟实验和关于2013-2015年西非埃博拉出血热爆发的案例研究来验证其性能。