We introduce an estimation method of covariance matrices in a high-dimensional setting, i.e., when the dimension of the matrix, , is larger than the sample size . Specifically, we propose an orthogonally equivariant estimator. The eigenvectors of such estimator are the same as those of the sample covariance matrix. The eigenvalue estimates are obtained from an adjusted profile likelihood function derived by approximating the integral of the density function of the sample covariance matrix over its eigenvectors, which is a challenging problem in its own right. Exact solutions to the approximate likelihood equations are obtained and employed to construct estimates that involve a tuning parameter. Bootstrap and cross-validation based algorithms are proposed to choose this tuning parameter under various loss functions. Finally, comparisons with two well-known orthogonally equivariant estimators of the covariance matrix are given, which are based on Monte-Carlo risk estimates for simulated data and misclassification errors in real data analyses. In addition, Monte-Carlo risk estimates are also provided to compare our estimates of eigenvalues to those of a consistent estimator of population eigenvalues.
翻译:在高维设置中,即当矩阵的尺寸大于样本大小时,我们采用共变矩阵的估计方法,即当矩阵的尺寸大于样本大小时,我们采用一种估计方法。具体地说,我们建议采用一个正方位正方位等异异异静测仪。这种估计值的源数与样本共变矩阵的值相同。这种估计值是从一个调整的剖面概率函数中得出的,该参数是样本共变矩阵的密度函数的内含性,它本身就是一个具有挑战性的问题。我们获得了对近似可能性方程式的精确解决办法,并用于构建涉及调值的估计数。建议根据各种损失函数选择这一调值。最后,根据Monte-Carlo对调异常态矩阵中两个广为人知的或正方位均匀估测值,根据对模拟数据的风险估计数和真实数据值中的误差,提供了对人口估计值的连续比较。此外,还提供了对人口估计值的蒙特-卡洛估计值与这些数值的一致的比较。