In many real-world problems, complex dependencies are present both among samples and among features. The Kronecker sum or the Cartesian product of two graphs, each modeling dependencies across features and across samples, has been used as an inverse covariance matrix for a matrix-variate Gaussian distribution, as an alternative to a Kronecker-product inverse covariance matrix, due to its more intuitive sparse structure. However, the existing methods for sparse Kronecker-sum inverse covariance estimation are limited in that they do not scale to more than a few hundred features and samples and that the unidentifiable parameters pose challenges in estimation. In this paper, we introduce EiGLasso, a highly scalable method for sparse Kronecker-sum inverse covariance estimation, based on Newton's method combined with eigendecomposition of the two graphs for exploiting the structure of Kronecker sum. EiGLasso further reduces computation time by approximating the Hessian based on the eigendecomposition of the sample and feature graphs. EiGLasso achieves quadratic convergence with the exact Hessian and linear convergence with the approximate Hessian. We describe a simple new approach to estimating the unidentifiable parameters that generalizes the existing methods. On simulated and real-world data, we demonstrate that EiGLasso achieves two to three orders-of-magnitude speed-up compared to the existing methods.
翻译:在许多现实世界问题中,样品和特征之间都存在复杂的依赖性。Kronecker和卡泰什产品或两个图形的卡尔内克体积或卡尔泰斯产物,每个特征和样本之间都建模依赖性,作为矩阵变异性高斯分布的反共变矩阵矩阵,作为Kronecker产品反常变异矩阵的替代,因为其结构更直观,稀释性稀释性稀释性稀释性。然而,现有的克罗内克和反常变性估计方法有限,因为它们的规模不大于几百个特征和样本,而且无法识别的参数对估算构成挑战。在本文件中,我们采用了EiGLasso,这是一种高度可伸缩的方法,用于稀释基列高正变和反常变性差异矩阵分布,其依据是牛顿的方法,加上两种图表的变异性组合,以利用克伦内尔克和反常态结构进行计算。EiGasso进一步减少计算时间,办法是根据对赫西亚的平面变速度参数进行对比,其基地基地组地组地组地组地组的样本和地基地组地组地组地组正地组地组地组正地组正地组正地组地组地组地组地组地组地组地组正,用新的地组地组地组地组地组地组地组地组地组地组地组地图,用新的地组地组地组地组地组地组地组地组地组地组地组地组地组地算。