We develop a dimension reduction framework for data consisting of matrices of counts. Our model is based on assuming the existence of a small amount of independent normal latent variables that drive the dependency structure of the observed data, and can be seen as the exact discrete analogue for a contaminated low-rank matrix normal model. We derive estimators for the model parameters and establish their root-$n$ consistency. An extension of a recent proposal from the literature is used to estimate the latent dimension of the model. Additionally, a sparsity-accommodating variant of the model is considered. The method is shown to surpass both its vectorization-based competitors and matrix methods assuming the continuity of the data distribution in analysing simulated data and real abundance data.
翻译:我们为由计数矩阵组成的数据制定了一个维度减少框架。我们的模型基于假设存在少量独立的正常潜在变数,这些变数驱动了观察到的数据的依赖性结构,可被视为受污染的低级矩阵正常模型的离散类比。我们为模型参数得出估计值,并确立其根值-美元一致性。文献中最近一项提案的延伸用于估计模型的潜值。此外,还考虑了模型的宽度-通融变式。这种方法超过了基于传量的竞争对手和矩阵方法,假设在分析模拟数据和实际丰度数据时数据分布的连续性。