Use of next-generation sequencing technologies to transcriptomics (RNA-seq) for gene expression profiling has found widespread application in studying different biological conditions including cancers. However, RNA-seq experiments are still small sample size experiments due to the cost. Recently, an increased focus has been on meta-analysis methods for integrated differential expression analysis for exploration of potential biomarkers. In this study, we propose a p-value combination method for meta-analysis of multiple related RNA-seq studies that accounts for sample size of a study and direction of expression of genes in individual studies. The proposed method generalizes the inverse-normal method without increase in computational complexity and does not pre- or post-hoc filter genes that have conflicting direction of expression in different studies. Thus, the proposed method, as compared to the inverse-normal, has better potential for the discovery of differentially expressed genes (DEGs) with potentially conflicting differential signals from multiple studies related to disease. We demonstrated the use of the proposed method in detection of biologically relevant DEGs in glioblastoma (GBM), the most aggressive brain cancer. Our approach notably enabled the identification of over-expression in GBM compared to healthy controls of the oncogene RAD51, which has recently been shown to be a target for inhibition to enhance radiosensitivity of GBM cells during treatment. Pathway analysis identified multiple aberrant GBM related pathways as well as novel regulators such as TCF7L2 and MAPT as important upstream regulators in GBM. The proposed method provides a way to establish differential expression status for genes with conflicting direction of expression in individual RNA-seq studies. Hence, leading to further exploration of them as potential biomarkers for the disease.
翻译:使用下一代测序技术进行基因表达式剖析(RNA-seq)的基因分析发现,在研究包括癌症在内的不同生物条件时,广泛应用了下一代测序技术;然而,RNA-seq实验由于成本原因,其样本规模仍然很小;最近,越来越重视为勘探潜在生物标志而采用综合差异表达分析的元分析方法;在本研究中,我们提出了一个对多种相关的RNA-sequal研究进行元分析的数值组合方法,该方法将研究的样本规模和基因在个人研究中的表达方向考虑在内。 拟议的方法将反常方法普遍化,而没有增加计算复杂性,而且没有在不同研究中出现相互矛盾的预选或后热液过滤基因。因此,与反常相比,拟议的方法更有可能发现不同表达的基因表达(DEGEG),其与疾病有关的不同信号可能相互冲突。 我们证明,拟议方法用于检测在血压细胞中与生物相关的DEG(GBM)的表达方式,这是最具有侵略性的脑癌的重要表现方式,因此,在对G-BM(G)的诊断过程中可以进一步识别与G-roal-roal-roal-mama)进行结果分析。