您的传播模型秘密地知道数据多维的维度 (Your diffusion model secretly knows the dimension of the data manifold) - 专知论文

会员服务 ·

0

流形 · MoDELS · 估计/估计量 · 有向 · 近似 ·

2022 年 12 月 23 日

Your diffusion model secretly knows the dimension of the data manifold

翻译：您的传播模型秘密地知道数据多维的维度

Georgios Batzolis,Jan Stanczuk,Carola-Bibiane Schönlieb

from arxiv, arXiv admin note: text overlap with arXiv:2207.09786

In this work, we propose a novel framework for estimating the dimension of the data manifold using a trained diffusion model. A trained diffusion model approximates the gradient of the log density of a noise-corrupted version of the target distribution for varying levels of corruption. If the data concentrates around a manifold embedded in the high-dimensional ambient space, then as the level of corruption decreases, the score function points towards the manifold, as this direction becomes the direction of maximum likelihood increase. Therefore, for small levels of corruption, the diffusion model provides us with access to an approximation of the normal bundle of the data manifold. This allows us to estimate the dimension of the tangent space, thus, the intrinsic dimension of the data manifold. Our method outperforms linear methods for dimensionality detection such as PPCA in controlled experiments.

翻译：在这项工作中,我们提出了一个新的框架,用经过培训的传播模型来估计数据元的维度。经过培训的传播模型接近了不同腐败程度的目标分布的噪音破坏版本的原木密度梯度。如果数据集中在高维环境空间内嵌入的多元体周围,那么随着腐败程度的下降,分数函数指向多元体,因为这个方向成为最大可能性增加的方向。因此,对于小的腐败程度,扩散模型使我们有机会接近数据元的正常捆绑。这使我们能够估计相干空间的维度,从而估计数据元的内在维度。我们的方法超越了在受控制的实验中进行维度探测的线性方法,如PPCA。

0

相关内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

沉淀合金中反位缺陷演化的微观相场模拟

国家自然科学基金

0+阅读 · 2013年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA AK055007靶定ILF3对肝癌调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

与玻色-爱因斯坦凝聚相关的确定与不确定系统孤立子的动力学行为

国家自然科学基金

0+阅读 · 2009年12月31日

HIF-1αAPK通路在微波辐射致海马线粒体损伤中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

RTN3对血管内皮细胞凋亡与粘附的双调控及其机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

$k$-Center Clustering with Outliers in the MPC and Streaming Model

Arxiv

0+阅读 · 2023年2月24日

Neural Network Approximation of Continuous Functions in High Dimensions with Applications to Inverse Problems

Arxiv

0+阅读 · 2023年2月24日

intRinsic: an R Package for Model-Based Estimation of the Intrinsic Dimension of a Dataset

Arxiv

0+阅读 · 2023年2月23日

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

Arxiv

0+阅读 · 2023年2月23日

An information-theoretic learning model based on importance sampling

Arxiv

0+阅读 · 2023年2月23日

Matrix Multiplication and Number On the Forehead Communication

Arxiv

0+阅读 · 2023年2月22日

Structured Denoising Diffusion Models in Discrete State-Spaces

Arxiv

0+阅读 · 2023年2月22日

Estimating Total Correlation with Mutual Information Estimators

Arxiv

0+阅读 · 2023年2月22日

Time Series Clustering with an EM algorithm for Mixtures of Linear Gaussian State Space Models

Arxiv

0+阅读 · 2023年2月22日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

VIP会员

文章信息

相关主题

估计/估计量

相关VIP内容

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

ICLR 2021杰出论文奖出炉，8篇论文上榜！

专知会员服务

26+阅读 · 2021年4月2日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Workshop

【ICIG2021】Latest News & Announcements of the Workshop

中国图象图形学学会CSIG

0+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

$k$-Center Clustering with Outliers in the MPC and Streaming Model

Arxiv

0+阅读 · 2023年2月24日

Neural Network Approximation of Continuous Functions in High Dimensions with Applications to Inverse Problems

Arxiv

0+阅读 · 2023年2月24日

intRinsic: an R Package for Model-Based Estimation of the Intrinsic Dimension of a Dataset

Arxiv

0+阅读 · 2023年2月23日

High-dimensional limit theorems for SGD: Effective dynamics and critical scaling

Arxiv

0+阅读 · 2023年2月23日

An information-theoretic learning model based on importance sampling

Arxiv

0+阅读 · 2023年2月23日

Matrix Multiplication and Number On the Forehead Communication

Arxiv

0+阅读 · 2023年2月22日

Structured Denoising Diffusion Models in Discrete State-Spaces

Arxiv

0+阅读 · 2023年2月22日

Estimating Total Correlation with Mutual Information Estimators

Arxiv

0+阅读 · 2023年2月22日

Time Series Clustering with an EM algorithm for Mixtures of Linear Gaussian State Space Models

Arxiv

0+阅读 · 2023年2月22日

Diffusion Models in Vision: A Survey

Arxiv

30+阅读 · 2022年9月10日

相关基金

神经系统seipin缺失诱发精神迟滞的分子机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

肿瘤抗原HCA587与STAT3的相互作用及其促进肿瘤转移的分子机制研究

国家自然科学基金

1+阅读 · 2014年12月31日

长链非编码RNA CAR intergenic 10在细胞衰老中的作用和机制

国家自然科学基金

1+阅读 · 2013年12月31日

混凝土Weibull统计尺寸效应理论模型改进研究

国家自然科学基金

0+阅读 · 2013年12月31日

沉淀合金中反位缺陷演化的微观相场模拟

国家自然科学基金

0+阅读 · 2013年12月31日

CuInGaSe2太阳能电池界面结构、界面态及其钝化

国家自然科学基金

0+阅读 · 2012年12月31日

长链非编码RNA AK055007靶定ILF3对肝癌调控机制的研究

国家自然科学基金

0+阅读 · 2011年12月31日

与玻色-爱因斯坦凝聚相关的确定与不确定系统孤立子的动力学行为

国家自然科学基金

0+阅读 · 2009年12月31日

HIF-1αAPK通路在微波辐射致海马线粒体损伤中的作用研究

国家自然科学基金

0+阅读 · 2009年12月31日

RTN3对血管内皮细胞凋亡与粘附的双调控及其机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员