We consider the problem of change point detection for high-dimensional distributions in a location family when the dimension can be much larger than the sample size. In change point analysis, the widely used cumulative sum (CUSUM) statistics are sensitive to outliers and heavy-tailed distributions. In this paper, we propose a robust, tuning-free (i.e., fully data-dependent), and easy-to-implement change point test that enjoys strong theoretical guarantees. To achieve the robust purpose in a nonparametric setting, we formulate the change point detection in the multivariate $U$-statistics framework with anti-symmetric and nonlinear kernels. Specifically, the within-sample noise is canceled out by anti-symmetry of the kernel, while the signal distortion under certain nonlinear kernels can be controlled such that the between-sample change point signal is magnitude preserving. A (half) jackknife multiplier bootstrap (JMB) tailored to the change point detection setting is proposed to calibrate the distribution of our $\ell^{\infty}$-norm aggregated test statistic. Subject to mild moment conditions on kernels, we derive the uniform rates of convergence for the JMB to approximate the sampling distribution of the test statistic, and analyze its size and power properties. Extensions to multiple change point testing and estimation are discussed with illustration from numerical studies.
翻译:当一个位置家庭高维分布的尺寸比样本大小大得多时,我们考虑在位置家庭高维分布的变化点检测问题,因为其尺寸比样本大小大得多。在变化点分析中,广泛使用的累积和累积统计(CUSUUUM)统计数据对外源和重尾分布十分敏感。在本文中,我们建议采用强力、无调(即完全数据依赖)和易于执行的改变点检测,这种测试具有很强的理论保证。为了在非参数设置中实现稳健的目的,我们在多变量(美元-统计框架)中以反对称和非线内内内内内核(CUUUUMUM)统计数据对外核外核和重尾的分布分布十分敏感。具体地,由于内核内核的反对称性,因此可以控制某些非线内核内核的信号扭曲,使变位点之间的信号在数量上得到强大的理论保证。为了在非参数设置变化点检测时,我们用美元-infty 美元-infty 统计值框架框架的分布范围, 和数值缩缩缩缩缩缩缩缩缩分析结果测试J。