In Bayesian Network Regression models, networks are considered the predictors of continuous responses. These models have been successfully used in brain research to identify regions in the brain that are associated with specific human traits, yet their potential to elucidate microbial drivers in biological phenotypes for microbiome research remains unknown. In particular, microbial networks are challenging due to their high-dimension and high sparsity compared to brain networks. Furthermore, unlike in brain connectome research, in microbiome research, it is usually expected that the presence of microbes have an effect on the response (main effects), not just the interactions. Here, we develop the first thorough investigation of whether Bayesian Network Regression models are suitable for microbial datasets on a variety of synthetic data that was generated under realistic biological scenarios. We test whether the Bayesian Network Regression model that accounts only for interaction effects (edges in the network) is able to identify key drivers in phenotypic variability (microbes). We show that this model is indeed able to identify influential nodes and edges in the microbial networks that drive changes in the phenotype for most biological settings, but we also identify scenarios where this method performs poorly which allows us to provide practical advice for domain scientists aiming to apply these tools to their datasets. Finally, we implement the model in a publicly available Julia package at https://github.com/solislemuslab/BayesianNetworkRegression.jl.
翻译:在Bayesian网络回归模型中,网络被认为是持续反应的预测因素。这些模型被成功地用于大脑研究,以确认大脑中与特定人类特性相关的区域,但是它们为微生物研究而在生物苯型研究中解释微生物驱动因素的潜力仍然未知。特别是,微生物网络因其高度分散和与大脑网络相比的偏狭性而具有挑战性。此外,与大脑连接研究不同,在微生物研究中,通常预期微生物的存在会对反应(主要影响)产生影响,而不仅仅是相互作用。在这里,我们开发了第一次彻底调查,了解Bayesian网络回归模型是否适合用于在现实生物假设下产生的各种合成数据中的微生物数据集。我们测试了Bayesian网络回归模型是否因其高度分散性和与大脑网络相比具有挑战性。此外,与大脑连接性研究不同,通常预计微生物网络的存在对反应(主要驱动因素)产生影响(主要影响),而不仅仅是相互作用。我们发现这个模型确实能够识别微生物网络中的有影响的节点和边缘。我们开发了Byesian网络的模型,推动在现实的模型中,我们将这些模型应用了最差的模型来测量模型,我们最终将这些模型用于生物结构的模型。