Learning multi-view data is an emerging problem in machine learning research, and nonnegative matrix factorization (NMF) is a popular dimensionality-reduction method for integrating information from multiple views. These views often provide not only consensus but also diverse information. However, most multi-view NMF algorithms assign equal weight to each view or tune the weight via line search empirically, which can be computationally expensive or infeasible without any prior knowledge of the views. In this paper, we propose a weighted multi-view NMF (WM-NMF) algorithm. In particular, we aim to address the critical technical gap, which is to learn both view-specific and observation-specific weights to quantify each view's information content. The introduced weighting scheme can alleviate unnecessary views' adverse effects and enlarge the positive effects of the important views by assigning smaller and larger weights, respectively. In addition, we provide theoretical investigations about the convergence, perturbation analysis, and generalization error of the WM-NMF algorithm. Experimental results confirm the effectiveness and advantages of the proposed algorithm in terms of achieving better clustering performance and dealing with the corrupted data compared to the existing algorithms.
翻译:在机器学习研究中,学习的多视角数据是一个正在出现的问题,非负矩阵因素化(NMF)是综合多种观点信息的一种流行的减少维度方法。这些观点往往不仅提供共识,而且提供多种信息。然而,大多数多视角NMF算法对每种观点都赋予同等的份量,或者通过线上搜索调节重量,这在不事先了解这些观点的情况下,计算成本可能很高或者不可行。在本文中,我们建议采用加权的多视图NMF算法(WM-NMF),特别是我们力求解决关键的技术差距,即学习特定观点和特定观察的权重,以量化每种观点的信息内容。引入的加权法可以分别对较小或更大的权重分别减轻不必要观点的不利影响,扩大重要观点的积极影响。此外,我们对WM-NMF算法的趋同、扰动分析以及一般错误进行理论性调查。实验结果证实了拟议的算法在实现更好的组合性表现和处理与现有算法相比腐败数据方面的有效性和优势。