Detecting when the underlying distribution changes for the observed time series is a fundamental problem arising in a broad spectrum of applications. In this paper, we study multiple change-point localization in the high-dimensional regression setting, which is particularly challenging as no direct observations of the parameter of interest is available. Specifically, we assume we observe $\{ x_t, y_t\}_{t=1}^n$ where $ \{ x_t\}_{t=1}^n $ are $p$-dimensional covariates, $\{y_t\}_{t=1}^n$ are the univariate responses satisfying $\mathbb{E}(y_t) = x_t^\top \beta_t^* \text{ for } 1\le t \le n $ and $\{\beta_t^*\}_{t=1}^n $ are the unobserved regression coefficients that change over time in a piecewise constant manner. We propose a novel projection-based algorithm, Variance Projected Wild Binary Segmentation~(VPWBS), which transforms the original (difficult) problem of change-point detection in $p$-dimensional regression to a simpler problem of change-point detection in mean of a one-dimensional time series. VPWBS is shown to achieve sharp localization rate $O_p(1/n)$ up to a log factor, a significant improvement from the best rate $O_p(1/\sqrt{n})$ known in the existing literature for multiple change-point localization in high-dimensional regression. Extensive numerical experiments are conducted to demonstrate the robust and favorable performance of VPWBS over two state-of-the-art algorithms, especially when the size of change in the regression coefficients $\{\beta_t^*\}_{t=1}^n $ is small.
翻译:当观测到的时间序列的基本分布变化是一个在广泛应用范围内产生的根本性问题时 。 在本文中, 我们研究高维回归设置中的多变点本地化, 因为没有直接观测利息参数, 这一点特别具有挑战性 。 具体地说, 我们假设我们观察$x_ t, y_ t ⁇ t=1 ⁇ n$, 其中美元 x_ t ⁇ t=1 ⁇ n$ 是 美元 x_ t ⁇ t=1 美元 维基数的基数, $_y_ t ⁇ t=1 ⁇ n rational=1 美元是 univarite religize $malb{( y_ p} (y_ t) y_t) = x_ t ⁇ t\\\\ t{t{ {tun} 。 我们假设我们观察到$ xxxx_ xx_ t_ t ⁇ t_ t ⁇ t=1 ⁇ n $ 美元, 美元的基数值的基数值变化是一个未知的基数 问题。 我们提议一种基于预测的算算算算算算算算算的算的算算的算算的法, 度变了原( 美元) talyaltime_ rolate_ legal- laxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx