While parametric multiple change point detection has been widely studied, less attention has been given to the nonparametric task of detecting multiple change points in a sequence of observations when their distribution is unknown. Most existing work on this topic is either based on penalized cost functions which can suffer from false positive detections, or on binary segmentation which can fail to detect certain configurations of change points. We introduce a new approach to change point detection which adapts the recently proposed Wild Binary Segmentation (WBS) procedure to a nonparametric setting. Our approach is based on the use of rank based test statistics which are especially powerful at detecting changes in location and/or scale. We show via simulation that the resulting nonparametric WBS procedure has favorable performance compared to existing methods, particularly when it comes to detecting changes in scale. We apply our procedure to study a problem in stylometry involving change points in an author's writing style, and provide a full implementation of our algorithm in an associated R package.
翻译:虽然对参数多变点的探测进行了广泛研究,但对于在分布不明的情况下,在一系列观测中发现多变点这一非参数性任务的关注却较少,关于这一专题的现有工作大多基于受罚的成本功能,这种功能可能受到假正探测的影响,或者基于二元分解,这种分解无法检测某些变化点的配置。我们采用了新的换点检测方法,将最近提议的野生二元分解(WBS)程序调整为非参数性设定。我们的方法是使用基于等级的测试统计数据,这种数据对检测地点和/或比例的变化特别有影响力。我们通过模拟表明,所产生的非参数性WBS程序比现有方法具有优异性性,特别是在检测规模变化时。我们采用程序来研究涉及作者写作风格变化点的音质测量问题,并在相关的R包中全面应用我们的算法。