In many transcriptomic studies, the correlation of genes might fluctuate with quantitative factors such as genetic ancestry. We propose a method that models the covariance between two variables to vary against a continuous covariate. For the bivariate case, the proposed score test statistic is computationally simple and robust to model misspecification of the covariance term. Subsequently, the method is expanded to test relationships between one highly connected gene, such as a transcription factor, and several other genes for a more global investigation of the dynamic of the coexpression network. Simulations show that the proposed method has higher statistical power than alternatives, can be used in more diverse scenarios, and is computationally cheaper. We apply this method to African American subjects from GTEx to analyze the dynamic behavior of their gene coexpression against genetic ancestry and to identify transcription factors whose coexpression with their target genes change with the genetic ancestry. The proposed method can be applied to a wide array of problems that require covariance modeling.
翻译:在许多笔录组学研究中,基因的关联可能与遗传基因祖先等定量因素发生波动。 我们提出一种方法来模拟两种变量之间的共变情况,以便与连续的共变情况发生差异。 对于双变情况,提议的评分测试统计是计算简单和有力的,以模拟共变术语的偏差。 随后,该方法扩大以测试一个高度相连的基因之间的关系,如转录因子,以及其他一些基因之间的关系,以便对共同表达网络的动态进行更全面的调查。 模拟显示,拟议的方法具有比替代方法更高的统计能力,可以在更多样化的情况下使用,而且计算得更便宜。 我们把这种方法应用到来自GTEx的非裔美国人主体,以分析其基因与遗传祖先的基因的共变异的动态表现,并找出与基因遗传遗传遗传遗传遗传遗传基因变异的相。 拟议的方法可以适用于需要共变模型的范围广泛的问题。