Understanding factors that contribute to the increased likelihood of disease transmission between two individuals is important for infection control. However, analyzing measures of genetic relatedness is complicated due to correlation arising from the presence of the same individual across multiple dyadic outcomes, potential spatial correlation caused by unmeasured transmission dynamics, and the distinctive distributional characteristics of some of the outcomes. We develop two novel hierarchical Bayesian spatial methods for analyzing dyadic genetic relatedness data, in the form of patristic distances and transmission probabilities, that simultaneously address each of these complications. Using individual-level spatially correlated random effect parameters, we account for multiple sources of correlation between the outcomes as well as other important features of their distribution. Through simulation, we show the limitations of existing approaches in terms of estimating key associations of interest, and the ability of the new methodology to correct for these issues across datasets with different levels of correlation. All methods are applied to Mycobacterium tuberculosis data from the Republic of Moldova where we identify previously unknown factors associated with disease transmission and, through analysis of the random effect parameters, key individuals and areas with increased transmission activity. Model comparisons show the importance of the new methodology in this setting. The methods are implemented in the R package GenePair.
翻译:然而,分析遗传关联性的方法很复杂,因为同一个人在多个三角结果中的存在、未测量的传导动态造成的潜在空间关联以及某些结果的独特的分布特点,我们开发了两种新型的波音等级空间方法,用以分析dyadic遗传关联性数据,其形式为父系距离和传导概率,同时处理这些并发症的每一种。我们使用个人层次的空间相关随机效应参数,说明结果与其他重要分布特征之间的相关性的多种来源。我们通过模拟,显示了现有方法在估计关键利益联系方面的局限性,以及新方法在涉及不同程度关联的数据集中纠正这些问题的能力。所有方法都用于摩尔多瓦共和国的Mycocactirium结核病数据,我们在那里查明了与疾病传播有关的先前未知的因素,并通过随机效应参数分析,查明了传播活动增加的关键个人和领域。模型比较显示了新方法在设定中的重要性。