We consider the problem of jointly modeling and clustering populations of tensors by introducing a flexible high-dimensional tensor mixture model with heterogeneous covariances. The proposed mixture model exploits the intrinsic structures of tensor data, and is assumed to have means that are low-rank and internally sparse as well as heterogeneous covariances that are separable and conditionally sparse. We develop an efficient high-dimensional expectation-conditional-maximization (HECM) algorithm that breaks the challenging optimization in the M-step into several simpler conditional optimization problems, each of which is convex, admits regularization and has closed-form updating formulas. We show that the proposed HECM algorithm, with an appropriate initialization, converges geometrically to a neighborhood that is within statistical precision of the true parameter. Such a theoretical analysis is highly nontrivial due to the dual non-convexity arising from both the EM-type estimation and the non-convex objective function in the M-step. The efficacy of our proposed method is demonstrated through simulation studies and an application to an autism spectrum disorder study, where our analysis identifies important brain regions for diagnosis.
翻译:我们考虑采用灵活高维抗体混合模型,采用多种差异共变体,共同建模和组组群变色体群的问题。拟议混合模型利用了高温数据的内在结构,并假定具有低层次、内部稀少以及不同差异的可分离和有条件稀释的手段。我们开发了高效的高维预期-条件最大化算法,将M级具有挑战性的优化分成几个更简单的条件优化问题,每个问题都是 convex、接受正规化和采用封闭式的更新公式。我们表明,拟议的HEMM算法经过适当的初始化,从几何方面结合到真实参数统计精确度范围内的邻里。这种理论分析由于EM型估计和非cionx目标功能在M级中产生的双重非共性,因此非常不具有边际性。我们拟议方法的功效通过模拟研究和应用自闭谱系障碍研究得到证明,我们的分析确定了重要的大脑诊断区域。