The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical, permutation-based and empirical ones, have some inherent drawbacks. For example, the theoretical null might fail because of improper assumptions on the sample distribution. Here, we propose a null distribution-free approach to FDR control for multiple hypothesis testing. This approach, named target-decoy procedure, simply builds on the ordering of tests by some statistic or score, the null distribution of which is not required to be known. Competitive decoy tests are constructed from permutations of original samples and are used to estimate the false target discoveries. We prove that this approach controls the FDR when the statistics are independent between different tests. Simulation demonstrates that it is more stable and powerful than two existing popular approaches. Evaluation is also made on a real dataset.
翻译:在多个假设测试中,对假发现率(FDR)控制的传统方法通常以测试统计数据的无效分布为基础,但所有类型的无效分布,包括理论、变异基础和经验分析,都有一些固有的缺点。例如,理论无效可能由于对抽样分布的不适当假设而失败。在这里,我们建议对多种假设测试采用一种无效的FDR控制无分配方法。这个方法称为目标标记程序,仅仅以某些统计或分数的测试顺序为基础,而这些统计或分数的无效分布则不必为人所知。竞争诱饵测试是从原始样品的变相中建立起来的,用来估计虚假的目标发现。我们证明,当统计数据在不同测试之间独立时,这种方法会控制FDR。模拟表明,它比现有的两种流行方法更稳定、更有力。评价还建立在真实数据集上。