The non-negative matrix factorization (NMF) model with an additional orthogonality constraint on one of the factor matrices, called the orthogonal NMF (ONMF), has been found a promising clustering model and can outperform the classical K-means. However, solving the ONMF model is a challenging optimization problem because the coupling of the orthogonality and non-negativity constraints introduces a mixed combinatorial aspect into the problem due to the determination of the correct status of the variables (positive or zero). Most of the existing methods directly deal with the orthogonality constraint in its original form via various optimization techniques, but are not scalable for large-scale problems. In this paper, we propose a new ONMF based clustering formulation that equivalently transforms the orthogonality constraint into a set of norm-based non-convex equality constraints. We then apply a non-convex penalty (NCP) approach to add them to the objective as penalty terms, leading to a problem that is efficiently solvable. One smooth penalty formulation and one non-smooth penalty formulation are respectively studied. We build theoretical conditions for the penalized problems to provide feasible stationary solutions to the ONMF based clustering problem, as well as proposing efficient algorithms for solving the penalized problems of the two NCP methods. Experimental results based on both synthetic and real datasets are presented to show that the proposed NCP methods are computationally time efficient, and either match or outperform the existing K-means and ONMF based methods in terms of the clustering performance.
翻译:非负矩阵因子化模型(NMF)与一个因素矩阵(称为正值或零值)的额外正值约束一道,非负矩阵因子因子因子因子因子因子因子因子因子因子因子模型(NMF)被发现是一个有希望的组合模型,能够超越经典K手段。然而,解决非负矩阵因子因子因子因子因子因子因子(正值或零值)的混合组合模型(NMF)模式是一个具有挑战性的优化问题。大部分现有方法直接通过各种优化技术处理其原形态的正值NMF(NMF)制约,但对于大规模问题来说无法伸缩。在本文件中,我们提出了一个新的基于NMFMF集群模型(ON)模式,以等量性因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子因子而相互混并异而相互混并并并并并并并并分不一,造成,造成问题。