Without imposing prior distributional knowledge underlying multivariate time series of interest, we propose a nonparametric change-point detection approach to estimate the number of change points and their locations along the temporal axis. We develop a structural subsampling procedure such that the observations are encoded into multiple sequences of Bernoulli variables. A maximum likelihood approach in conjunction with a newly developed searching algorithm is implemented to detect change points on each Bernoulli process separately. Then, aggregation statistics are proposed to collectively synthesize change-point results from all individual univariate time series into consistent and stable location estimations. We also study a weighting strategy to measure the degree of relevance for different subsampled groups. Simulation studies are conducted and shown that the proposed change-point methodology for multivariate time series has favorable performance comparing with currently popular nonparametric methods under various settings with different degrees of complexity. Real data analyses are finally performed on categorical, ordinal, and continuous time series taken from fields of genetics, climate, and finance.
翻译:我们建议采用非参数的变化点检测方法,估计时间轴沿线的变化点及其位置。我们开发了一个结构子抽样程序,将观测编码成Bernoulli变量的多个序列。与新开发的搜索算法一起实施最大可能性方法,以分别探测伯尔尼利进程每个进程的变化点。然后,提出汇总统计数据,将所有单项单项时间序列的变化点结果合并为一致和稳定的位置估计。我们还研究一个加权战略,以衡量不同子抽样组的相关性程度。进行模拟研究,并表明,拟议中的多变量时间序列变化点方法与不同复杂程度的当前流行的非参数方法相比,具有有利的性能。最终在绝对性、正统性以及从遗传学、气候和金融领域连续进行的时间序列上进行了真正的数据分析。