This paper studies the multi-task high-dimensional linear regression models where the noise among different tasks is correlated, in the moderately high dimensional regime where sample size $n$ and dimension $p$ are of the same order. Our goal is to estimate the covariance matrix of the noise random vectors, or equivalently the correlation of the noise variables on any pair of two tasks. Treating the regression coefficients as a nuisance parameter, we leverage the multi-task elastic-net and multi-task lasso estimators to estimate the nuisance. By precisely understanding the bias of the squared residual matrix and by correcting this bias, we develop a novel estimator of the noise covariance that converges in Frobenius norm at the rate $n^{-1/2}$ when the covariates are Gaussian. This novel estimator is efficiently computable. Under suitable conditions, the proposed estimator of the noise covariance attains the same rate of convergence as the "oracle" estimator that knows in advance the regression coefficients of the multi-task model. The Frobenius error bounds obtained in this paper also illustrate the advantage of this new estimator compared to a method-of-moments estimator that does not attempt to estimate the nuisance. As a byproduct of our techniques, we obtain an estimate of the generalization error of the multi-task elastic-net and multi-task lasso estimators. Extensive simulation studies are carried out to illustrate the numerical performance of the proposed method.
翻译:本文研究不同任务之间的噪音相互关联的多任务高维线性回归模型, 不同任务之间的噪音在中等高度系统中是相互关联的。 在中等高度系统中, 样本大小为$n美元和维度为$p$是相同的顺序。 我们的目标是估计噪音随机矢量的共变矩阵, 或等效音变量在任何对两种任务中的相关性。 将回归系数作为扰动参数处理, 我们利用多任务弹性网和多任务lasso测量器来估计扰动。 通过精确理解正方位剩余矩阵的偏差, 并通过纠正这一偏差, 我们开发了一个创新的关于噪音共变异性模型的新的估测器。 将微缩系数的推算器比值比值比值比值比值比值, 将微缩缩微缩缩图的推算法比值比值比值比值比值比值比值值值。 微缩缩略度估测算器比值比值比值比值比值比值比值比值, 数字模型比值比值比值比值比值比值比值比值比值模型, 数值比值比值比值比值比值比值比值比值比值的计算法, 。