This paper presents DRE-CUSUM, an unsupervised density-ratio estimation (DRE) based approach to determine statistical changes in time-series data when no knowledge of the pre-and post-change distributions are available. The core idea behind the proposed approach is to split the time-series at an arbitrary point and estimate the ratio of densities of distribution (using a parametric model such as a neural network) before and after the split point. The DRE-CUSUM change detection statistic is then derived from the cumulative sum (CUSUM) of the logarithm of the estimated density ratio. We present a theoretical justification as well as accuracy guarantees which show that the proposed statistic can reliably detect statistical changes, irrespective of the split point. While there have been prior works on using density ratio based methods for change detection, to the best of our knowledge, this is the first unsupervised change detection approach with a theoretical justification and accuracy guarantees. The simplicity of the proposed framework makes it readily applicable in various practical settings (including high-dimensional time-series data); we also discuss generalizations for online change detection. We experimentally show the superiority of DRE-CUSUM using both synthetic and real-world datasets over existing state-of-the-art unsupervised algorithms (such as Bayesian online change detection, its variants as well as several other heuristic methods).
翻译:本文介绍了DRE-CUSUM, 这是一种在对估计密度比率的对数累积总和(CUSUM)中得出的基于密度估计(DRE-CUSUM)法,在对变化前和变化后分布分布不知情的情况下,确定时间序列数据的统计变化,而DRE-CUSUUM(DRE-CUSUUM)法是未经监督的密度估计(DRE-CUSUUM)法。我们提出了一个理论理由和准确保证,表明拟议的统计可以可靠地检测统计变化,而不论差异点如何。虽然我们以前曾努力使用基于密度的方法来检测变化,但根据我们的最佳知识,这是第一个采用理论理由和准确性保证的未经监督的变化检测方法。拟议的框架的简单性使得它很容易适用于各种实际环境(包括高维度时间序列数据);我们还讨论了用于在线变化检测的概括性以及准确性保证。我们实验性地展示了以真实的合成-C-C-SVLA方法取代了目前的一些新的DRE-C-VI-S-S-SVIL-S-S-S-SVID-SUD-SD-S-SD-S-S-S-S-S-S-SVLOVLVIOL-S-S-S-S-S