We investigate block diagonal and hierarchical nested stochastic multivariate Gaussian models by studying their sample cross-correlation matrix on high dimensions. By performing numerical simulations, we compare a filtered sample cross-correlation with the population cross-correlation matrices by using several rotationally invariant estimators (RIE) and hierarchical clustering estimators (HCE) under several loss functions. We show that at large but finite sample size, sample cross-correlation filtered by RIE estimators are often outperformed by HCE estimators for several of the loss functions. We also show that for block models and for hierarchically nested block models the best determination of the filtered sample cross-correlation is achieved by introducing two-step estimators combining state-of-the-art non-linear shrinkage models with hierarchical clustering estimators.
翻译:我们通过在高维方面研究其样本交叉关系矩阵,对块形三角和等级嵌套多变高斯星模型进行调查。通过进行数字模拟,我们将过滤的样本交叉关系与人口交叉关系矩阵进行比较,在几个损失函数下,使用若干旋转不定的测算器(RIE)和等级集群测算器(HCE)。我们显示,在大但有限的样本大小中,由RIE估计器过滤的样本交叉关系往往超过HCE测算器对若干损失函数的测算器。我们还显示,对于块形模型和按等级标定的块模型,通过引入两步测算器,将高级非线性非线性测算器与分层集测算器相结合,可以最好地确定过滤的样本交叉关系。