A recent breakthrough in differential network (DN) analysis of microbiome data has been realized with the advent of next-generation sequencing technologies. The DN analysis disentangles the microbial co-abundance among taxa by comparing the network properties between two or more graphs under different biological conditions. However, the existing methods to the DN analysis for microbiome data do not adjust for other clinical differences between subjects. We propose a Statistical Approach via Pseudo-value Information and Estimation for Differential Network Analysis (SOHPIE-DNA) that incorporates additional covariates such as continuous age and categorical BMI. SOHPIE-DNA is a regression technique adopting jackknife pseudo-values that can be implemented readily for the analysis. We demonstrate through simulations that SOHPIE-DNA consistently reaches higher recall and F1-score, while maintaining similar precision and accuracy to existing methods (NetCoMi and MDiNE). Lastly, we apply SOHPIE-DNA on two real datasets from the American Gut Project and the Diet Exchange Study to showcase the utility. The analysis of the Diet Exchange Study is to showcase that SOHPIE-DNA can also be used to incorporate the temporal change of connectivity of taxa with the inclusion of additional covariates. As a result, our method has found taxa that are related to the prevention of intestinal inflammation and severity of fatigue in advanced metastatic cancer patients.
翻译:微生物组数据的差异网络(DN)分析是一种通过比较两个或多个在不同生物条件下的图形的网络属性来分离生物系统结构的方法。然而,目前针对微生物组数据的差异网络分析方法没有考虑到其他临床性差异的影响。本文提出一种统计学方法,通过采用Jackknife伪值的估计和信息进行差异网络分析(SOHPIE-DNA),该方法可以轻松地在分析中将其他协变量(如连续的年龄和分类的BMI)纳入考虑。通过模拟实验,我们证明了SOHPIE-DNA在保持类似准确度和精度的情况下,能够始终达到更高的召回率和F1-score,且优于现有的方法(NetCoMi和MDiNE)。最后,我们应用SOHPIE-DNA方法对两个真实数据集(来自美国肠道计划和饮食交换研究)进行展示。饮食交换研究的分析结果表明,SOHPIE-DNA还可以用于纳入附加协变量的实时网络分析,以展示微生物共现网络变化的程度。因此,我们的方法发现了与防止肠炎和减轻晚期转移性癌症患者疲劳严重程度相关的微生物。