Many statistical settings call for estimating a population parameter, most typically the population mean, based on a sample of matrices. The most natural estimate of the population mean is the arithmetic mean, but there are many other matrix means that may behave differently, especially in high dimensions. Here we consider the matrix harmonic mean as an alternative to the arithmetic matrix mean. We show that in certain high-dimensional regimes, the harmonic mean yields an improvement over the arithmetic mean in estimation error as measured by the operator norm. Counter-intuitively, studying the asymptotic behavior of these two matrix means in a spiked covariance estimation problem, we find that this improvement in operator norm error does not imply better recovery of the leading eigenvector. We also show that a Rao-Blackwellized version of the harmonic mean is equivalent to a linear shrinkage estimator studied previously in the high-dimensional covariance estimation literature, while applying a similar Rao-Blackwellization to regularized sample covariance matrices yields a novel nonlinear shrinkage estimator. Simulations complement the theoretical results, illustrating the conditions under which the harmonic matrix mean yields an empirically better estimate.
翻译:许多统计设置都要求根据矩阵样本来估计人口参数,通常是人口平均值。最自然的人口平均值是算术平均值,但还有其他许多矩阵方法可能不同,特别是在高维方面。在这里,我们认为矩阵调和平均值是计算矩阵平均值的替代值。我们显示,在某些高维系统中,调和平均值比操作员标准所测量的估计误差的算算值平均值有改进作用。反直觉,研究这两个矩阵的无反应性行为意味着在急剧变化的估计问题中出现新的非线性缩小估计值。我们发现,操作员标准错误的这一改进并不意味着更好地恢复领先的导体元体。我们还表明,调和率平均值的拉奥-黑化版本相当于以前在高维常量估计文献中研究的线性缩小估计值,同时对常规化的样本共变差矩阵采用类似的拉奥-黑化模型,得出新的非线性缩略估计值。我们模拟了理论结果,说明了调和矩阵意味着得出更好的实验性估计值的条件。