Modern genomic studies are increasingly focused on discovering more and more interesting genes associated with a health response. Traditional shrinkage priors are primarily designed to detect a handful of signals from tens and thousands of predictors. Under diverse sparsity regimes, the nature of signal detection is associated with a tail behaviour of a prior. A desirable tail behaviour is called tail-adaptive shrinkage property where tail-heaviness of a prior gets adaptively larger (or smaller) as a sparsity level increases (or decreases) to accommodate more (or less) signals. We propose a global-local-tail (GLT) Gaussian mixture distribution to ensure this property and provide accurate inference under diverse sparsity regimes. Incorporating a peaks-over-threshold method in extreme value theory, we develop an automated tail learning algorithm for the GLT prior. We compare the performance of the GLT prior to the Horseshoe in two gene expression datasets and numerical examples. Results suggest that varying tail rule is advantageous over fixed tail rule under diverse sparsity domains.
翻译:现代基因组研究日益侧重于发现与健康反应有关的更多、更有趣的基因。传统的缩水前科主要设计用来探测数万和数千个预测器的几组信号。在不同的宽度制度下,信号探测的性质与先发体的尾部行为相关。可取的尾部行为称为尾部适应性缩水特性,先发体的尾部-重力作为宽度增加(或减少)以容纳更多(或更少)信号。我们提议全球-当地尾部(GLT)高斯的混合物分布,以确保这一属性,并在不同的保温制度下提供准确的推断。在极端价值理论中采用峰值超过临界值的方法,我们为GLT开发了一种自动尾部学习算法。我们用两个基因表达数据集和数字例子比较了马蹄之前GLT的性能。结果显示,不同的尾部规则优于不同储量范围内的固定尾部规则。