Change point testing is a well-studied problem in statistics. Owing to the emergence of high-dimensional data with structural breaks, there has been a recent surge of interest in developing methods to accommodate high-dimensionality. In practice, when the dimension is less than the sample size but is not small, it is often unclear whether a method that is tailored to high-dimensional data or simply a classical method that is developed and justified for low-dimensional data is preferred. In addition, the methods designed for low-dimensional data may not work well in the high-dimensional environment and vice versa. This naturally brings up the question of whether there is a change point test that can work for data of low, medium, and high dimensions. In this paper, we first propose a dimension-agnostic testing procedure targeting a single change point in the mean of multivariate time series. Our new test is inspired by the recent work of arXiv:2011.05068, who formally developed the notion of ``dimension-agnostic" in several testing problems for iid data. We develop a new test statistic by adopting their sample splitting and projection ideas, and combining it with the self-normalization method for time series. Using a novel conditioning argument, we are able to show that the limiting null distribution for our test statistic is the same regardless of the dimensionality and the magnitude of cross-sectional dependence. The power analysis is also conducted to understand the large sample behavior of the proposed test. Furthermore, we present an extension to test for multiple change points in the mean and derive the limiting distributions of the new test statistic under both the null and alternatives. Through Monte Carlo simulations, we show that the finite sample results strongly corroborate the theory and suggest that the proposed tests can be used as a benchmark for many time series data.
翻译:变点测试是统计学中研究深入的问题。随着高维数据的出现并且数据中存在结构性断点,近年来出现了对应于高维数据的方法的大量研究工作。实际应用中,当维数小于样本大小但不小的时候,通常很难确定哪种方法更适合,是专门针对高维数据设计的方法还是适用于低维数据的经典方法。此外,针对低维数据设计的方法在高维环境中可能效果不佳,反之亦然。这自然引出了一个问题,是否存在一种变点检测方法,适用于低、中、和高维数据。本文首先提出了一个无关维度的测试过程,针对多元时间序列均值中的单个变点。我们的新测试受到arXiv:2011.05068最近关于iid数据中几个测试问题中“无关维度”概念的启示。我们采用他们的样本分割和投影思想,结合时间序列自标准化方法,开发了一个新的检验统计量。我们通过采用新型的条件论证方法,证明了我们的检验统计量的极限零分布是相同的,无论维数和横向相关性的大小如何。本研究还进行了功效分析,以了解所提出的测试的大样本行为。此外,我们还提出了一种扩展方法,用于测试均值中的多个变点,并导出了在零假设和备择假设下新测试统计量的极限分布。通过蒙特卡洛模拟,我们证明了有限样本的结果强烈佐证了理论,并表明所提出的测试方法可以用作许多时间序列数据的基准。