We introduce a new methodology 'charcoal' for estimating the location of sparse changes in high-dimensional linear regression coefficients, without assuming that those coefficients are individually sparse. The procedure works by constructing different sketches (projections) of the design matrix at each time point, where consecutive projection matrices differ in sign in exactly one column. The sequence of sketched design matrices is then compared against a single sketched response vector to form a sequence of test statistics whose behaviour shows a surprising link to the well-known CUSUM statistics of univariate changepoint analysis. Strong theoretical guarantees are derived for the estimation accuracy of the procedure, which is computationally attractive, and simulations confirm that our methods perform well in a broad class of settings.
翻译:我们采用了一种新的方法“木炭”来估计高维线性回归系数中微小变化的位置,而没有假设这些系数是个别稀疏的。程序工作方法是在每个时间点构建设计矩阵的不同草图(预测 ), 在每个时间点上, 连续的投影矩阵在一栏上有不同的标志。 然后, 草图设计矩阵的顺序与单一的草图反应矢量相比较, 形成一系列测试统计数据, 其行为显示与众所周知的CUSUM单向变化点分析统计数据有惊人的联系。 为估算程序准确性提供了强有力的理论保证,这种预测在计算上具有吸引力, 模拟证实我们的方法在广泛的环境中表现良好 。