We introduce a new methodology for two-sample testing of high-dimensional linear regression coefficients without assuming that those coefficients are individually estimable. The procedure works by first projecting the matrices of covariates and response vectors along directions that are complementary in sign in a subset of the coordinates, a process which we call 'complementary sketching'. The resulting projected covariates and responses are aggregated to form two test statistics. We show that our procedure has essentially optimal asymptotic power under Gaussian designs with a general class of design covariance matrices when the difference between the two regression coefficients is sparse and dense respectively. Simulations confirm that our methods perform well in a broad class of settings.
翻译:我们引入了一种新的方法,用于对高维线性回归系数进行二类测试,而不必假设这些系数是个人可估量的。程序是首先在坐标的一个子块上沿着补充标志的方位预测共变矢量和反应矢量矩阵,我们称之为“补充草图”的过程。因此,预测的共变量和反应将汇总成两个测试统计数据。我们表明,在两种回归系数的差别分别是稀少和密集的情况下,我们的程序在高斯设计下基本上具有最佳的无药可耐力,在设计共变矩阵中具有一般类型的设计变量。模拟证实我们的方法在广泛的环境类别中表现良好。