面向复杂数据的稀疏流形学习方法研究

项目名称： 面向复杂数据的稀疏流形学习方法研究

项目编号： No.61272333

项目类型： 面上项目

立项/批准年度： 2013

项目学科： 自动化技术、计算机技术

项目作者： 雷迎科

作者单位： 中国人民解放军电子工程学院

项目金额： 79万元

中文摘要： 本项目主要针对大规模、高维数、非线性、噪声污染等复杂性数据，系统研究基于稀疏表示的流形学习方法，及其在蛋白质相互作用数据中的应用。首先，设计一种基于广义相关的非负稀疏表示近邻图构建模型，有效解决基于K-近邻或∈-球近邻准则构图的流形学习方法对于数据噪声敏感和邻域尺度参数选择难的问题。在此基础上，提出一种基于最小相关熵的数据流形本征维数估计方法。接着，设计一种基于局部与全局联合保持嵌入的稀疏流形学习方法，并针对大规模复杂数据，构建基于最小子集覆盖和谱回归混合策略的高效求解方案。最后，提出一种鲁棒的基于大规模稀疏流形嵌入的蛋白质相互作用数据去噪方法，为检测大规模蛋白质相互作用网络中的假阳性与假阴性噪声问题提供一条新的解决途径。本项目的开展既能促进机器学习的基础理论研究，又能推动其在生物信息学领域的应用。

中文关键词： 流形学习；稀疏表示；近邻图构建；蛋白质相互作用；

英文摘要： This project systematically studies sparse representation based manifold learning methods for analyzing complex data including large-scale, high-dimensional, nonlinear, and noisy data as well as their applications to the protein-protein interaction data. Firstly, an efficient non-negative sparse neighborhood graph model based on generalized correlation is devised to overcome the drawbacks of KNN or ∈-ball graph based classical manifold learning methods, such as difficulties in tuning graph neighborhood size and sensitivity to noise. According to this graph, a minimum-relative-entropy based intrinsic dimensionality estimation method for data manifold is developed. Secondly, we propose a novel sparse manifold learning method which seeks to find a low-dimensional manifold embedded in the ambient space by preserving locally and globally geometric structure, meanwhile adopting minimum set cover and spectral regression techniques to make the proposed method suitable for large-scale complex data. Deriving from the proposed method, we further develop a robust large-scale sparse manifold embedding method to assess the reliability of protein-protein interactions and predict new ones which can be considered as one novel promising solution for detecting both false positive and false negative interactions in protein inter

英文关键词： manifold learning；sparse representation；neighborhood graph construction；protein-protein interactions；

成为VIP会员查看完整内容

相关内容

流形学习

关注 0

流形学习，全称流形学习方法(Manifold Learning)，自2000年在著名的科学杂志《Science》被首次提出以来，已成为信息科学领域的研究热点。在理论和应用上，流形学习方法都具有重要的研究意义。假设数据是均匀采样于一个高维欧氏空间中的低维流形，流形学习就是从高维采样数据中恢复低维流形结构，即找到高维空间中的低维流形，并求出相应的嵌入映射，以实现维数约简或者数据可视化。它是从观测到的现象中去寻找事物的本质，找到产生数据的内在规律。

基于表格数据的深度学习方法

专知会员服务

39+阅读 · 2021年10月19日

面向目标检测的对抗样本综述

专知会员服务

49+阅读 · 2021年9月9日

单幅图像超分辨率重建技术研究进展

专知会员服务

29+阅读 · 2021年9月6日

基于深度学习的小目标检测方法综述

专知会员服务

125+阅读 · 2021年4月29日