The issue of combining individual $p$-values to aggregate multiple small effects is prevalent in many scientific investigations and is a long-standing statistical topic. Many classical methods are designed for combining independent and frequent signals in a traditional meta-analysis sense using the sum of transformed $p$-values with the transformation of light-tailed distributions, in which Fisher's method and Stouffer's method are the most well-known. Since the early 2000, advances in big data promoted methods to aggregate independent, sparse and weak signals, such as the renowned higher criticism and Berk-Jones tests. Recently, Liu and Xie(2020) and Wilson(2019) independently proposed Cauchy and harmonic mean combination tests to robustly combine $p$-values under "arbitrary" dependency structure, where a notable application is to combine $p$-values from a set of often correlated SNPs in genome-wide association studies. The proposed tests are the transformation of heavy-tailed distributions for improved power with the sparse signal. It calls for a natural question to investigate heavy-tailed distribution transformation, to understand the connection among existing methods, and to explore the conditions for a method to possess robustness to dependency. In this paper, we investigate the regularly varying distribution, which is a rich family of heavy-tailed distribution and includes Pareto distribution as a special case. We show that only an equivalent class of Cauchy and harmonic mean tests have sufficient robustness to dependency in a practical sense. We also show an issue caused by large negative penalty in the Cauchy method and propose a simple, yet practical modification. Finally, we present simulations and apply to a neuroticism GWAS application to verify the discovered theoretical insights and provide practical guidance.
翻译:将个人美元价值与多重小额效应相结合的问题在许多科学调查中十分普遍,而且是一个长期的统计专题。许多古典方法设计在传统的元分析意义上结合独立和经常信号,使用转化后的美元价值和光速分配结构的转变,其中最广为人知的是Fisher的方法和Stouffer的方法。自2000年初以来,大数据推广方法的进展使得独立、稀疏和薄弱的信号(如著名的实际批评和Berk-Jones测试)更加一致。最近,刘和谢(2020202020年)和Wilson(2019年)独立地提议在“任意”依赖性结构下将独立和频繁的信号结合成一个传统的元值,在“任意”依赖性结构下将美元价值和光速分配结构的转变结合起来。自2000年初以来,大数据推广方法的转变是为了提高实力的重尾量分布。它要求有一个自然问题来调查重尾部的理论转换,在现有的方法中提出大规模和口服的混合的混合组合测试,我们用一种稳健的货币分配方式来调查一个特殊的货币分配情况。我们从一个特殊的货币分配的方法来显示一种稳重的货币分配。