Hypothesis testing of structure in correlation and covariance matrices is of broad interest in many application areas. In high dimensions and/or small to moderate sample sizes, high error rates in testing is a substantial concern. This article focuses on increasing power through a frequentist assisted by Bayes (FAB) procedure. This FAB approach boosts power by including prior information on the correlation parameters. In particular, we suppose there is one of two sources of prior information: (i) a prior dataset that is distinct from the current data but related enough that it may contain valuable information about the correlation structure in the current data; and (ii) knowledge about a tendency for the correlations in different parameters to be similar so that it is appropriate to consider a hierarchical model. When the prior information is relevant, the proposed FAB approach can have significant gains in power. A divide-and-conquer algorithm is developed to reduce computational complexity in massive testing dimensions. We show improvements in power for detecting correlated gene pairs in genomic studies while maintaining control of Type I error or false discover rate (FDR).
翻译:对相关性和共变矩阵的结构进行假设性测试在许多应用领域引起了广泛的兴趣。在高尺寸和(或)小到中度样本规模中,测试中的高误率是一个重大关切问题。本条款侧重于通过贝耶斯(Bayes)程序协助的常客程序来增加权力。FAB方法通过事先列入有关相关参数的信息来增强权力。特别是,我们认为,先前信息的两个来源之一:(一) 先前的数据集不同于当前数据,但相关程度足以包含关于当前数据中相关结构的宝贵信息;以及(二) 了解不同参数中的相关性趋势,以便考虑一个等级模式是适当的。在以前的信息相关时,拟议的FAB方法可以取得显著的实力收益。正在开发一种分化算算算法,以减少大规模测试的计算复杂性。我们在基因研究中发现相关基因配对的能力有所改进,同时保持对类型I错误或错误发现率的控制(FDR)。