Non-negative matrix factorization (NMF) is a fundamental matrix decomposition technique that is used primarily for dimensionality reduction and is increasing in popularity in the biological domain. Although finding a unique NMF is generally not possible, there are various iterative algorithms for NMF optimization that converge to locally optimal solutions. Such techniques can also serve as a starting point for deep learning methods that unroll the algorithmic iterations into layers of a deep network. Here we develop unfolded deep networks for NMF and several regularized variants in both a supervised and an unsupervised setting. We apply our method to various mutation data sets to reconstruct their underlying mutational signatures and their exposures. We demonstrate the increased accuracy of our approach over standard formulations in analyzing simulated and real mutation data.
翻译:非负矩阵因子化(NMF)是一种基本矩阵分解技术,主要用于减少维度,在生物领域越来越受欢迎。虽然一般不可能找到独特的NMF,但有各种用于优化NMF的迭代算法,这些算法可以集中到当地最佳解决办法。这些技术也可以作为深层学习方法的起点,将算法的迭代法分解成深层网络。在这里,我们开发出国家MF的深层网络和几个正规化变异体,既在受监督的环境中,又在不受监督的环境中。我们用我们的方法对各种变异数据集进行了应用,以重建其基本突变特征及其暴露。我们在分析模拟和真实变异数据时,对标准配方的方法的准确性提高了。我们展示了在分析模拟和真实变异性数据时对标准配方的准确性。