In many practices, scientists are particularly interested in detecting which of the predictors are truly associated with a multivariate response. It is more accurate to model multiple responses as one vector rather than separating each component one by one. This is particularly true for complex traits having multiple correlated components. A Bayesian multivariate variable selection (BMVS) approach is proposed to select important predictors influencing the multivariate response from a candidate pool with an ultrahigh dimension. By applying the sample-size-dependent spike and slab priors, the BMVS approach satisfies the strong selection consistency property under certain conditions, which represents the advantages of BMVS over other existing Bayesian multivariate regression-based approaches. The proposed approach considers the covariance structure of multiple responses without assuming independence and integrates the estimation of covariance-related parameters together with all regression parameters into one framework through a fast updating MCMC procedure. It is demonstrated through simulations that the BMVS approach outperforms some other relevant frequentist and Bayesian approaches. The proposed BMVS approach possesses the flexibility of wide applications, including genome-wide association studies with multiple correlated phenotypes and a large scale of genetic variants and/or environmental variables, as demonstrated in the real data analyses section. The computer code and test data of the proposed method are available as an R package.
翻译:在许多做法中,科学家特别感兴趣的是检测哪些预测器真正与多变量反应有关;更准确的做法是将多重反应模型作为一个矢量,而不是将每个组成部分一个一个一个地分开;对于具有多个相关组成部分的复杂特征尤其如此;提议采用巴耶斯多变量选择方法,从一个具有超高尺寸的候选人才库中选择影响多变量反应的重要预测器;通过应用抽样大小的峰值和板块前缀,BMVS方法在某些条件下满足了很强的选择一致性特性,这代表了BMVS相对于其他现有的巴耶斯多变量回归法方法的优势;拟议方法考虑了多种反应的共变结构,而没有假设独立性,并且通过快速更新MCMC程序将共变参数和所有回归参数的估算纳入一个框架;通过模拟,表明BMVS方法超越了其他一些相关的频繁和巴耶斯方法。 拟议的BMVS方法具有广泛应用的灵活性,包括基因组联系研究,而不是现有的巴耶斯多变量,作为计算机模型和大规模测试模型模型,作为多种关联式的模型,作为已展示的模型和计算机式数据和大规模模型模型模型分析。