Given a nonnegative matrix factorization, $R$, and a factorization rank, $r$, Exact nonnegative matrix factorization (Exact NMF) decomposes $R$ as the product of two nonnegative matrices, $C$ and $S$ with $r$ columns, such as $R = CS^\top$. A central research topic in the literature is the conditions under which such a decomposition is unique/identifiable, up to trivial ambiguities. In this paper, we focus on partial identifiability, that is, the uniqueness of a subset of columns of $C$ and $S$. We start our investigations with the data-based uniqueness (DBU) theorem from the chemometrics literature. The DBU theorem analyzes all feasible solutions of Exact NMF, and relies on sparsity conditions on $C$ and $S$. We provide a mathematically rigorous theorem of a recently published restricted version of the DBU theorem, relying only on simple sparsity and algebraic conditions: it applies to a particular solution of Exact NMF (as opposed to all feasible solutions) and allows us to guarantee the partial uniqueness of a single column of $C$ or $S$. Second, based on a geometric interpretation of the restricted DBU theorem, we obtain a new partial identifiability result. We prove it is stronger than the restricted DBU theorem, given that a proper preprocessing on the Exact NMF is used. This geometric interpretation also leads us to another partial identifiability result in the case $r=3$. Third, we show how partial identifiability results can be used sequentially to guarantee the identifiability of more columns of $C$ and $S$. We illustrate these results on several examples, including one from the chemometrics literature.
翻译:鉴于一个非负矩阵因子化,美元是一个核心研究课题,而一个因子化等级是美元,美元,美元,非负矩阵因子化(Exact NMF)作为两个非负矩阵(DBU)的产物,美元和美元(美元)的产物,用美元(美元)的柱子,如美元(美元),美元(美元)和美元(美元)。文献中的一个核心研究课题是这种分解的独特/可识别条件,以至微不足道的含糊不清。在本文中,我们侧重于部分可识别性,即美元和美元(美元)的一组列的独特性。我们开始调查时,以基于数据的独特性(DBU)的标本(美元)为美元(美元), 美元(美元)的标本(美元)为美元(美元)的精确性(美元(美元),我们从最近出版的限解的版本中提供了一个数学前部分精度的词框,仅依靠简单易变和高额(美元)的解释性(美元), 也以基于某种亚值(美元)的解算法性(美元)的解算出一个特殊的解算出一个特殊的解算。