We propose a change-point detection method for large scale multiple testing problems with data having clustered signals. Unlike the classic change-point setup, the signals can vary in size within a cluster. The clustering structure on the signals enables us to effectively delineate the boundaries between signal and non-signal segments. New test statistics are proposed for observations from one and/or multiple realizations. Their asymptotic distributions are derived. We also study the associated variance estimation problem. We allow the variances to be heteroscedastic in the multiple realization case, which substantially expands the applicability of the proposed method. Simulation studies demonstrate that the proposed approach has a favorable performance. Our procedure is applied to {an array based Comparative Genomic Hybridization (aCGH)} dataset.
翻译:我们建议了一种变化点检测方法,用于大规模多重测试问题,而数据是集成的信号。与传统的变化点设置不同,信号在组内的规模可能不同。信号上的群集结构使我们能够有效地划定信号与非信号部分之间的界限。提出了新的测试统计数据,以便从一个和(或)多个实现中进行观测。它们无症状分布得到推导。我们还研究相关的差异估计问题。我们允许在多个实现中差异是混杂的,这大大扩大了拟议方法的适用性。模拟研究表明,拟议方法具有有利的性能。我们的程序适用于{基于阵列的比较基因交集(aCGH)}数据集。