In this paper, we study the problem of multiple change-point detection for a univariate sequence under the epidemic setting, where the behavior of the sequence alternates between a common normal state and different epidemic states. This is a non-trivial generalization of the classical (single) epidemic change-point testing problem. To explicitly incorporate the alternating structure of the problem, we propose a novel model selection based approach for simultaneous inference on both change-points and alternating states. Using the same spirit as profile likelihood, we develop a two-stage alternating pruned dynamic programming algorithm, which conducts efficient and exact optimization of the model selection criteria and has $O(n^2)$ as the worst case computational cost. As demonstrated by extensive numerical experiments, compared to classical general-purpose multiple change-point detection procedures, the proposed method improves accuracy for both change-point estimation and model parameter estimation. We further show promising applications of the proposed algorithm to multiple testing with locally clustered signals, and demonstrate its advantages over existing methods in large scale multiple testing, in DNA copy number variation detection, and in oceanographic study.
翻译:在本文中,我们研究了在流行病环境下对单一异象序列进行多重变化点检测的问题,在这种状态下,一个共同的正常状态和不同流行病状态之间的顺序交替行为。这是对传统(单一)流行病变化点检测问题的非三重概括。为了明确纳入这一问题的交替结构,我们提出了一个基于新颖的模式选择方法,用于同时对变化点和交替状态进行推断。我们用与剖面可能性相同的精神,开发了两阶段交替的双轨动动态编程算法,对模型选择标准进行高效和精确的优化,并以O(n)2美元作为最差的计算成本。与传统的通用多位点检测程序相比,拟议的方法提高了变化点估算和模型参数估测的准确性。我们进一步展示了拟议的算法对使用当地集束信号进行多重测试的有希望的应用,并展示了其在大规模多重测试、DNA复制数变异检测和海洋学研究中对现有方法的优势。