高维数据的几何结构分析 - 专知基金

会员服务 ·

1

稀疏编码 · 低秩表示 · 流形学习 · 一阶优化 ·

2012 年 12 月 31 日

高维数据的几何结构分析

国家自然科学基金

国家自然科学基金委员会

项目名称： 高维数据的几何结构分析

项目编号： No.61272341

项目类型： 面上项目

立项/批准年度： 2013

项目学科： 自动化技术、计算机技术

项目作者： 林宙辰

作者单位： 北京大学

项目金额： 81万元

中文摘要： 当今是高维和海量数据的时代，如何快速有效地处理高维数据是一个巨大挑战。高维数据的分布非常复杂，几何结构分析是分析高维数据的重要方法，因为数据的几何结构蕴涵了数据的聚类和分类信息。本项目针对现有方法的一些不足之处，利用稀疏表示、半黎曼几何、核方法等数学工具，研究鲁棒的线性或非线性多子流形分解的数学模型，并基于半黎曼几何推广流形学习的理论与方法，以解决感知距离（流形上距离）小于欧氏距离的问题。为刻画数据分布的稀疏性，本项目进一步研究保持范数可分解性和促结构稀疏性的运算，以及高阶稀疏性的可计算的度量。这是目前稀疏表示理论的关键问题。本项目还研究相应的快速算法，尤其是低复杂度的随机算法，及其GPU实现，以解决处理高维数据时计算上的困难。

中文关键词： 稀疏表示；低秩表示；子空间聚类；流形学习；一阶优化

英文摘要： Nowadays we are facing with high dimensional and huge amount of data. How to process high dimensional data is a big challenge. The distribution of high dimensional data is very complicated. Geometric structural analysis is an important method to analyze high dimensional data, because the geometric structure of data implies the clustering and classification information of data. Aiming at addressing some drawbacks of existing methods, this project utilizes several mathematical tools, e.g., sparse representation, semi-Riemannian geometry, and kernel method, to investigate the mathematical models that can decompose linear or nonlinear multi-submanifolds robustly, and generalize the manifold learning theories and methods based on semi-Riemannian geometry, in order to address the issue of perceptual distance (manifold distance) being smaller than Euclidean distance. To characterize the sparsity in data distribution, this project further explores the operations that can preserve the decomposability and structural sparsity inducibility of norms, as well as the computable high-order sparsity measure, which are the key problems of the current sparse representation theories. Finally, this project studies the corresponding fast algorithms, especially the low complexity randomized algorithms, and their implementations on GPU

英文关键词： sparse representation；low-rank representation；subspace clustering；manifold learning；first order optimization

成为VIP会员查看完整内容

3

相关内容

稀疏编码

这种方法被称为Sparse Coding。通俗的说，就是将一个信号表示为一组基的线性组合，而且要求只需要较少的几个基就可以将信号表示出来

【硬核书】用于机器学习和数据挖掘的数学分析，968页pdf

专知会员服务

187+阅读 · 2021年9月3日

图像去噪方法概述

专知会员服务

43+阅读 · 2021年8月30日

算法分析导论, 593页pdf

算法分析导论, 593页pdf

专知会员服务

151+阅读 · 2021年8月30日

【经典书】半监督学习，524页pdf

【经典书】半监督学习，524页pdf

专知会员服务

139+阅读 · 2021年8月20日

【开放书】《矩阵流形优化算法》，241页pdf

【开放书】《矩阵流形优化算法》，241页pdf

专知会员服务

96+阅读 · 2021年7月3日

【ICML2021】数据表示的几何评估

专知会员服务

38+阅读 · 2021年6月3日

【博士论文】基于深度学习的图像处理算法研究

专知会员服务

81+阅读 · 2020年12月6日

复杂网络的双曲空间表征学习方法

专知会员服务

47+阅读 · 2020年11月13日

大规模时间序列分析框架的研究与实现，计算机学报

大规模时间序列分析框架的研究与实现，计算机学报

专知会员服务

59+阅读 · 2020年7月13日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

基于机器学习的自动化网络流量分析

基于机器学习的自动化网络流量分析

CCF计算机安全专委会

5+阅读 · 2022年4月8日

神经网络，凉了?

神经网络，凉了?

CVer

2+阅读 · 2022年3月16日

【博士论文】分形计算系统

【博士论文】分形计算系统

专知

3+阅读 · 2021年12月9日

【伯克利马毅老师等重磅新书】低维模型进行高维数据分析:原理、计算和应用，710页pdf

【伯克利马毅老师等重磅新书】低维模型进行高维数据分析:原理、计算和应用，710页pdf

专知

45+阅读 · 2020年12月9日

常见的距离算法和相似度计算方法

常见的距离算法和相似度计算方法

极市平台

18+阅读 · 2020年7月31日

博客 | 度量学习总结(二) | 如何使用度量学习处理高维数据？

博客 | 度量学习总结(二) | 如何使用度量学习处理高维数据？

AI研习社

20+阅读 · 2019年3月26日

深度丨顾险峰：深度学习的几何观点——流形分布定律

深度丨顾险峰：深度学习的几何观点——流形分布定律

德先生

17+阅读 · 2018年6月11日

谷歌实习生提出tSNE在大型高维数据集上实时可视化的方法（附代码）

谷歌实习生提出tSNE在大型高维数据集上实时可视化的方法（附代码）

论智

13+阅读 · 2018年6月8日

【干货】深入理解变分自编码器

【干货】深入理解变分自编码器

专知

21+阅读 · 2018年3月22日

手把手教你用LDA特征选择

手把手教你用LDA特征选择

AI研习社

12+阅读 · 2017年8月21日

超高维数据中若干检验问题的研究

国家自然科学基金

0+阅读 · 2015年12月31日

大数据中的广义稀疏几何结构学习方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

鲁棒几何结构描述及图像识别

国家自然科学基金

1+阅读 · 2012年12月31日

矢量数学形态学理论及其在高维数据处理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于几何计算的可视媒体数据特征提取方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于约束的高维数据聚类

国家自然科学基金

2+阅读 · 2012年12月31日

高维数据建模与分析的若干问题

国家自然科学基金

0+阅读 · 2012年12月31日

面向高维数据的稀疏正则化方法及应用

国家自然科学基金

2+阅读 · 2011年12月31日

多媒体数据的几何特征表示与分析

国家自然科学基金

0+阅读 · 2011年12月31日

高维数据统计建模与分析

国家自然科学基金

6+阅读 · 2011年12月31日

Tikhonov Regularization of Circle-Valued Signals

Tikhonov Regularization of Circle-Valued Signals

Arxiv

1+阅读 · 2022年4月20日

The White-Box Adversarial Data Stream Model

Arxiv

0+阅读 · 2022年4月19日

A Re-analysis of Repeatability and Reproducibility in the Ames-USDOE-FBI Study

Arxiv

0+阅读 · 2022年4月19日

Expected $L_2-$discrepancy bound for a class of new stratified sampling models

Arxiv

0+阅读 · 2022年4月19日

High-Dimensional Geometric Streaming in Polynomial Space

Arxiv

0+阅读 · 2022年4月18日

Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey

Arxiv

0+阅读 · 2022年4月18日

Numerical methods to evaluate Koopman matrix from system equations

Arxiv

0+阅读 · 2022年4月17日

Analytical Benchmark Problems for Multifidelity Optimization Methods

Arxiv

0+阅读 · 2022年4月16日

Statistical-Computational Trade-offs in Tensor PCA and Related Problems via Communication Complexity

Arxiv

0+阅读 · 2022年4月15日

Meta-Learning: A Survey

Arxiv

136+阅读 · 2018年10月8日

阅读: 0 点赞: 0

小贴士

登录享主题订阅及个性化推荐

相关主题

热门VIP内容

开通专知VIP会员享更多权益服务

【MIT博士论文】弱监督学习：理论、方法与应用

Andrej Karpathy：2025 年 LLM 年度回顾（2025 LLM Year in Review）

锚定情报：合成欺骗时代的地面真相

NeurIPS 2025 | NMKE：基于神经元归因与动态稀疏掩码的终身知识编辑

相关VIP内容

【硬核书】用于机器学习和数据挖掘的数学分析，968页pdf

专知会员服务

187+阅读 · 2021年9月3日

图像去噪方法概述

专知会员服务

43+阅读 · 2021年8月30日

算法分析导论, 593页pdf

算法分析导论, 593页pdf

专知会员服务

151+阅读 · 2021年8月30日

【经典书】半监督学习，524页pdf

【经典书】半监督学习，524页pdf

专知会员服务

139+阅读 · 2021年8月20日

【开放书】《矩阵流形优化算法》，241页pdf

【开放书】《矩阵流形优化算法》，241页pdf

专知会员服务

96+阅读 · 2021年7月3日

【ICML2021】数据表示的几何评估

专知会员服务

38+阅读 · 2021年6月3日

【博士论文】基于深度学习的图像处理算法研究

专知会员服务

81+阅读 · 2020年12月6日

复杂网络的双曲空间表征学习方法

专知会员服务

47+阅读 · 2020年11月13日

大规模时间序列分析框架的研究与实现，计算机学报

大规模时间序列分析框架的研究与实现，计算机学报

专知会员服务

59+阅读 · 2020年7月13日

自动结构变分推理，Automatic structured variational inference

自动结构变分推理，Automatic structured variational inference

专知会员服务

41+阅读 · 2020年2月10日

相关资讯

基于机器学习的自动化网络流量分析

基于机器学习的自动化网络流量分析

CCF计算机安全专委会

5+阅读 · 2022年4月8日

神经网络，凉了?

神经网络，凉了?

CVer

2+阅读 · 2022年3月16日

【博士论文】分形计算系统

【博士论文】分形计算系统

专知

3+阅读 · 2021年12月9日

【伯克利马毅老师等重磅新书】低维模型进行高维数据分析:原理、计算和应用，710页pdf

【伯克利马毅老师等重磅新书】低维模型进行高维数据分析:原理、计算和应用，710页pdf

专知

45+阅读 · 2020年12月9日

常见的距离算法和相似度计算方法

常见的距离算法和相似度计算方法

极市平台

18+阅读 · 2020年7月31日

博客 | 度量学习总结(二) | 如何使用度量学习处理高维数据？

博客 | 度量学习总结(二) | 如何使用度量学习处理高维数据？

AI研习社

20+阅读 · 2019年3月26日

深度丨顾险峰：深度学习的几何观点——流形分布定律

深度丨顾险峰：深度学习的几何观点——流形分布定律

德先生

17+阅读 · 2018年6月11日

谷歌实习生提出tSNE在大型高维数据集上实时可视化的方法（附代码）

谷歌实习生提出tSNE在大型高维数据集上实时可视化的方法（附代码）

论智

13+阅读 · 2018年6月8日

【干货】深入理解变分自编码器

【干货】深入理解变分自编码器

专知

21+阅读 · 2018年3月22日

手把手教你用LDA特征选择

手把手教你用LDA特征选择

AI研习社

12+阅读 · 2017年8月21日

相关基金

超高维数据中若干检验问题的研究

国家自然科学基金

0+阅读 · 2015年12月31日

大数据中的广义稀疏几何结构学习方法研究

国家自然科学基金

2+阅读 · 2013年12月31日

鲁棒几何结构描述及图像识别

国家自然科学基金

1+阅读 · 2012年12月31日

矢量数学形态学理论及其在高维数据处理中的应用

国家自然科学基金

0+阅读 · 2012年12月31日

基于几何计算的可视媒体数据特征提取方法研究

国家自然科学基金

1+阅读 · 2012年12月31日

基于约束的高维数据聚类

国家自然科学基金

2+阅读 · 2012年12月31日

高维数据建模与分析的若干问题

国家自然科学基金

0+阅读 · 2012年12月31日

面向高维数据的稀疏正则化方法及应用

国家自然科学基金

2+阅读 · 2011年12月31日

多媒体数据的几何特征表示与分析

国家自然科学基金

0+阅读 · 2011年12月31日

高维数据统计建模与分析

国家自然科学基金

6+阅读 · 2011年12月31日

相关论文

Tikhonov Regularization of Circle-Valued Signals

Tikhonov Regularization of Circle-Valued Signals

Arxiv

1+阅读 · 2022年4月20日

The White-Box Adversarial Data Stream Model

Arxiv

0+阅读 · 2022年4月19日

A Re-analysis of Repeatability and Reproducibility in the Ames-USDOE-FBI Study

Arxiv

0+阅读 · 2022年4月19日

Expected $L_2-$discrepancy bound for a class of new stratified sampling models

Arxiv

0+阅读 · 2022年4月19日

High-Dimensional Geometric Streaming in Polynomial Space

Arxiv

0+阅读 · 2022年4月18日

Empirical Evaluation and Theoretical Analysis for Representation Learning: A Survey

Arxiv

0+阅读 · 2022年4月18日

Numerical methods to evaluate Koopman matrix from system equations

Arxiv

0+阅读 · 2022年4月17日

Analytical Benchmark Problems for Multifidelity Optimization Methods

Arxiv

0+阅读 · 2022年4月16日

Statistical-Computational Trade-offs in Tensor PCA and Related Problems via Communication Complexity

Arxiv

0+阅读 · 2022年4月15日

Meta-Learning: A Survey

Arxiv

136+阅读 · 2018年10月8日

微信扫码咨询专知VIP会员