The human microbiome can contribute to pathogeneses of many complex diseases by mediating disease-leading causal pathways. However, standard mediation analysis methods are not adequate to analyze the microbiome as a mediator due to the excessive number of zero-valued sequencing reads in the data. The two main challenges raised by the zero-inflated data structure are: (a) disentangling the mediation effect induced by the point mass at zero; and (b) identifying the observed zero-valued data points that are actually not zero (i.e., false zeros). We develop a novel mediation analysis method under the potential-outcomes framework to fill this gap. We show that the mediation effect of the microbiome can be decomposed into two components that are inherent to the two-part nature of zero-inflated distributions. With probabilistic models to account for observing zeros, we also address the challenge with false zeros. A comprehensive simulation study and the applications in two real microbiome studies demonstrate that our approach outperforms existing mediation analysis approaches.
翻译:人类微生物可以通过在疾病导致的因果关系路径中进行调解,促进许多复杂疾病的病原体。然而,标准的调解分析方法不足以分析微生物作为调解者的作用,因为数据中显示的零价值排序数量过多。零膨胀的数据结构提出的两个主要挑战有:(a) 分解零质量点引发的调解效应;(b) 确定观察到的零价值数据点,但实际上不是零(即虚零)。我们在潜在结果框架内开发了一种新的调解分析方法,以填补这一空白。我们表明,微生物的调解效果可以分解成两个组成部分,这两个组成部分是零膨胀分布的两部分所固有的。在考虑观察零的概率模型的情况下,我们还用假零来应对挑战。全面模拟研究以及两项真正的微生物研究的应用表明,我们的方法超过了现有的调解分析方法。