Consider a multiple hypothesis testing setting in which only a small proportion of the measured features contain non-null effects. Under typical conditions, the log of the P-value associated with each feature is approximately a sparse mixture of chi-squared distributions, one of which is scaled and non-central. We characterize the asymptotic performance of global tests combining these P-values in terms of the chisquared mixture parameters: the scaling parameters controlling heteroscedasticity, the non-centrality parameter describing the effect size, and the parameter controlling the rarity of the non-null features. Specifically, in a phase space involving the last two parameters, we derive a region where all tests are asymptotically powerless. Outside of this region, the Berk-Jones and the Higher Criticism tests of these P-values have maximal power. Inference techniques based on the minimal P-value, false-discovery rate controlling, and Fisher's combination test have sub-optimal asymptotic performance. We provide various examples for recently studied signal detection models that fall under our setting as well as several new ones. Our perturbed log-chisquared P-values formulation seamlessly generalizes these models to their two-sample variant and heteroscedastic situations. The log-chisquared approximation for P-values under the alternative hypothesis is different from Bahadur's classical log-normal approximation. The latter turns out to be unsuitable for analyzing optimal detection in rare/weak feature models.
翻译:考虑一个多位假设测试设置, 测量的特性中只有一小部分含有非核效应。 在典型条件下, 与每个特性相关的P值日志大约是一个稀疏的奇二次分布混合体, 其中之一是缩放和非中央的。 我们用奇二次混合参数来描述这些P值结合的全球测试的无症状性能: 控制超度的缩放参数、 描述影响大小的非中央参数以及控制非核特征的稀释性的参数。 具体地说, 在涉及最后两个参数的阶段空间里, 我们得出一个所有测试都处于无症状性弱分布的区域。 在这个区域之外, Berk- Jones 和这些P值的高级Critictical测试具有最大性能。 基于最小P值的缩放值、 误差率控制以及 Fishercher的组合测试具有亚均匀性功能的亚优性。 我们为最近研究的信号检测模型提供了各种示例, 这些信号模型将在我们设定的平流度中以平整的平流值为正值的平流值为新。