It is well known that most of the existing theoretical results in statistics are based on the assumption that the sample is generated with replacement from an infinite population. However, in practice, available samples are almost always collected without replacement. If the population is a finite set of real numbers, whether we can still safely use the results from samples drawn without replacement becomes an important problem. In this paper, we focus on the eigenvalues of high-dimensional sample covariance matrices generated without replacement from finite populations. Specifically, we derive the Tracy-Widom laws for their largest eigenvalues and apply these results to parallel analysis. We provide new insight into the permutation methods proposed by Buja and Eyuboglu in [Multivar Behav Res. 27(4) (1992) 509--540]. Simulation and real data studies are conducted to demonstrate our results.
翻译:众所周知,大多数统计的现有理论结果都基于一种假设,即抽样是由无穷人口取而代之,但实际上,收集的样本几乎总是不替换的;如果人口是一组有限的实际数字,那么我们能否仍然安全地使用从不替换的抽样中得出的结果就成为一个重要问题;在本文中,我们侧重于在不由有限人口取而代之的情况下产生的高维采样共变矩阵的元值。具体地说,我们为最大的无穷人口取出Tracy-Widom法律,并将这些结果用于平行分析。我们对Buja和Eyuboglu in[Multivar Behav Res. 27(4)(1992) 509-540]提议的变异方法提供了新的了解。我们进行了模拟和真实数据研究,以展示我们的结果。