One common approach to detecting change-points is minimizing a cost function over possible numbers and locations of change-points. The framework includes several well-established procedures, such as the penalized likelihood and minimum description length. Such an approach requires finding the cost value repeatedly over different segments of the data set, which can be time-consuming when (i) the data sequence is long and (ii) obtaining the cost value involves solving a non-trivial optimization problem. This paper introduces a new sequential method (SE) that can be coupled with gradient descent (SeGD) and quasi-Newton's method (SeN) to find the cost value effectively. The core idea is to update the cost value using the information from previous steps without re-optimizing the objective function. The new method is applied to change-point detection in generalized linear models and penalized regression. Numerical studies show that the new approach can be orders of magnitude faster than the Pruned Exact Linear Time (PELT) method without sacrificing estimation accuracy.
翻译:一种常见的发现变化点的方法是尽可能减少变化点数量和地点的成本功能。框架包括若干既定程序,如受处罚的可能性和最低描述长度。这种方法要求反复查找数据集不同部分的成本价值,如果(一) 数据序列很长,而且(二) 获得成本价值涉及解决非三重优化问题,则可能耗费时间。本文介绍了一种新的顺序方法,可以与梯度下降和准牛顿方法(SeN)相结合,有效找到成本价值。核心思想是利用以往步骤的信息更新成本价值,而不重新优化目标功能。新的方法用于在一般线性模型中进行变更点检测和惩罚回归。数字研究显示,新的方法在不牺牲估计准确性的情况下,其规模可以比Prunened Exact Linear时间(PELT)方法更快。