Pathogen deep-sequencing is an increasingly routinely used technology in infectious disease surveillance. We present a semi-parametric Bayesian Poisson model to exploit these emerging data for inferring infectious disease transmission flows and the sources of infection at the population level. The framework is computationally scalable in high dimensional flow spaces thanks to Hilbert Space Gaussian process approximations, allows for sampling bias adjustments, and estimation of gender- and age-specific transmission flows at finer resolution than previously possible. We apply the approach to densely sampled, population-based HIV deep-sequence data from Rakai, Uganda, and find substantive evidence that adolescent and young women are predominantly infected through age-disparate relationships.
翻译:病原体深层序列是传染病监测中日益常用的一种技术,我们提出了一个半参数贝叶斯-普瓦松模型,以利用这些新出现的数据来推断传染病的传播流量和人口层面的感染源;由于希尔伯特空间高森进程近似值,该框架可计算出在高维流动空间的可扩展性,从而可以进行抽样偏差调整,并比以往更精确地估计性别和年龄特定传播流。我们采用乌干达拉凯的密集抽样、基于人口的艾滋病毒深层序列数据,并找到大量证据,证明少女和年轻妇女主要通过年龄差异关系受感染。