Tensor decomposition is a powerful tool for extracting physically meaningful latent factors from multi-dimensional nonnegative data, and has been an increasing interest in a variety of fields such as image processing, machine learning, and computer vision. In this paper, we propose a sparse nonnegative Tucker decomposition and completion method for the recovery of underlying nonnegative data under noisy observations. Here the underlying nonnegative data tensor is decomposed into a core tensor and several factor matrices with all entries being nonnegative and the factor matrices being sparse. The loss function is derived by the maximum likelihood estimation of the noisy observations, and the $\ell_0$ norm is employed to enhance the sparsity of the factor matrices. We establish the error bound of the estimator of the proposed model under generic noise scenarios, which is then specified to the observations with additive Gaussian noise, additive Laplace noise, and Poisson observations, respectively. Our theoretical results are better than those by existing tensor-based or matrix-based methods. Moreover, the minimax lower bounds are shown to be matched with the derived upper bounds up to logarithmic factors. Numerical examples on both synthetic and real-world data sets demonstrate the superiority of the proposed method for nonnegative tensor data completion.
翻译:电离分解是从多维非阴性数据中提取具有物理意义的潜在因素的有力工具,对图像处理、机器学习和计算机视觉等各个领域的兴趣日益浓厚。 在本文中,我们提出了一种稀疏的非阴性的塔克分解和完成方法,以便在噪音观测中恢复基本的非阴性数据。 基础的非阴性数据振素分解正在分解成一个核心微粒和几个要素矩阵,所有条目都是非阴性,要素矩阵正在稀释。 损失函数来自对噪音观测的最大可能性估计, 并使用$\ell_0美元标准来提高要素矩阵的宽度。 我们根据通用噪音假设, 确定拟议模型的估算符的错误界限, 并随后分别与添加加固的高音、添加的拉贝噪音和 Poisson 观测的观测结果相匹配。 我们的理论结果比现有的高压或基于要素矩阵的方法要好。 此外, 微缩缩缩缩缩缩图的界限与合成集成型数据集成型集成型集成型模型相比, 也显示了Nusblock log 。