Detecting change points sequentially in a streaming setting, especially when both the mean and the variance of the signal can change, is often a challenging task. A key difficulty in this context often involves setting an appropriate detection threshold, which for many standard change statistics may need to be tuned depending on the pre-change and post-change distributions. This presents a challenge in a sequential change detection setting when a signal switches between multiple distributions. For example, consider a signal where change points are indicated by increases/decreases in the mean and variance of the signal. In this context, we would like to be able to compare our change statistic to a fixed threshold that will be symmetric to either increases or decreases in the mean and variance. Unfortunately, change point detection schemes that use the log-likelihood ratio, such as CUSUM and GLR, are quick to react to changes but are not symmetric when both the mean and the variance of the signal change. This makes it difficult to set a single threshold to detect multiple change points sequentially in a streaming setting. We propose a modified version of CUSUM that we call Data-Adaptive Symmetric CUSUM (DAS-CUSUM). The DAS-CUSUM change point detection procedure is symmetric for changes between distributions, making it suitable to set a single threshold to detect multiple change points sequentially in a streaming setting. We provide results that relate to the expected detection delay and average run length for our proposed procedure. Extensive simulations are used to validate these results. Experiments on real-world data further show the utility of using DAS-CUSUM over both CUSUM and GLR.
翻译:在流流环境中,特别是当信号的平均值和值差都可能发生变化时,连续检测变化点往往是一项具有挑战性的任务。这方面的一个关键困难往往涉及设定一个适当的检测阈值,对于许多标准的变更统计可能需要根据变化前和变化后分布情况加以调整。这是在多个分布之间的信号开关时,在顺序检测设置中遇到的挑战。例如,考虑一个信号通过信号平均值和值差的增减来显示变化点的信号。在这方面,我们希望能够将我们的变化统计与固定阈值进行比较,这个固定阈值将具有对称性,要么可以增加,要么可以减少平均值和差异。不幸的是,对于许多标准的变更统计值统计可能需要根据变化前和变化后的分布分布比对调。如果在多个分布的平均值和信号变异时,则对顺序检测显示一个单一的阈值。我们建议修改的 CUUM-A的版本,我们建议将数据-Aral-Aral的测算结果调成一个稳定的运行点。我们用这些测算性数据-AUR的测算的运行点将显示一个特定的测算程序。