Multiple testing with false discovery rate (FDR) control has been widely conducted in the ``discrete paradigm" where p-values have discrete and heterogeneous null distributions. However, in this scenario existing FDR procedures often lose some power and may yield unreliable inference, and for this scenario there does not seem to be an FDR procedure that partitions hypotheses into groups, employs data-adaptive weights and is non-asymptotically conservative. We propose a weighted FDR procedure for multiple testing in the discrete paradigm that efficiently adapts to both the heterogeneity and discreteness of p-value distributions. We theoretically justify the non-asymptotic conservativeness of the weighted FDR procedure under independence, and show via simulation studies that, for multiple testing based on p-values of Binomial test or Fisher's exact test, it is more powerful than six other procedures. The weighted FDR procedure is applied to a drug safety study and a differential methylation study based on discrete data, where it makes more discoveries than two existing methods.
翻译:假发现率(FDR)控制在“分解范式”中广泛进行了多重测试,p值具有离散性和异差性,但是,在这种假设中,现有的FDR程序往往失去一些力量,可能产生不可靠的推论,对于这种假设,似乎没有一种FDR程序将假设分成组,使用数据适应性重量,并且非随机保守。我们提议在离散范式中采用加权FDR程序进行多重测试,以有效适应p-价值分布的异异性和异差性。我们理论上证明,独立情况下加权FDR程序的非非被动保守性是合理的,并通过模拟研究表明,对于基于Binomial试验或Fishercher精确试验的p值的多重测试,它比其他六种程序更强大。加权FDR程序适用于一项药物安全研究和基于离散数据的差异甲基化研究,在其中发现多于两种现有方法。