母体产品国神经相向核心:聚合与应用 (Neural Tangent Kernel of Matrix Product States: Convergence and Applications) - 专知论文

会员服务 ·

0

矩阵乘积 · 核化 · Mercer 核 · 泛函 · 均方误差 ·

2021 年 11 月 28 日

Neural Tangent Kernel of Matrix Product States: Convergence and Applications

翻译：母体产品国神经相向核心:聚合与应用

Erdong Guo,David Draper

from arxiv, 19 pages, 1 figure

In this work, we study the Neural Tangent Kernel (NTK) of Matrix Product States (MPS) and the convergence of its NTK in the infinite bond dimensional limit. We prove that the NTK of MPS asymptotically converges to a constant matrix during the gradient descent (training) process (and also the initialization phase) as the bond dimensions of MPS go to infinity by the observation that the variation of the tensors in MPS asymptotically goes to zero during training in the infinite limit. By showing the positive-definiteness of the NTK of MPS, the convergence of MPS during the training in the function space (space of functions represented by MPS) is guaranteed without any extra assumptions of the data set. We then consider the settings of (supervised) Regression with Mean Square Error (RMSE) and (unsupervised) Born Machines (BM) and analyze their dynamics in the infinite bond dimensional limit. The ordinary differential equations (ODEs) which describe the dynamics of the responses of MPS in the RMSE and BM are derived and solved in the closed-form. For the Regression, we consider Mercer Kernels (Gaussian Kernels) and find that the evolution of the mean of the responses of MPS follows the largest eigenvalue of the NTK. Due to the orthogonality of the kernel functions in BM, the evolution of different modes (samples) decouples and the "characteristic time" of convergence in training is obtained.

翻译：在这项工作中,我们研究了MIT产品国(MPS)的Neal Tangent Kernel(NTK)及其NTK在无限债券维度限制中的趋同性。我们证明MPS的NTK在梯度下降(培训)过程(以及初始阶段)中几乎会与一个恒定矩阵相融合,因为人们发现MPS的变异性在无限债券维度的训练期间将MPS的变异性逐渐变为零。通过显示MPS的NTK的正确定性,MPS在功能空间(MPS所代表的功能空间)培训期间的趋同性会得到保证,而没有额外的数据集假设。我们随后会考虑(SMSE)和(不受监督的)原始机器(BM)的反向性,并分析其在无限债券维度限制中的动态。普通差异方程式(ODS)描述了MPS的动态,而MPS在功能空间(MPS所代表的功能空间空间(MPS)的趋同级变异性(KER的变异性)的变异性功能是“我们KREG的变式”的变式。我们的CRECRA的变的变的变和CRBMA的变的变的变的变和变。我们的变的变的变和变的变的变和变的变的变。

0

相关内容

矩阵乘积

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【斯坦福CS205L新课程】聚焦机器学习的连续数学方法

【斯坦福CS205L新课程】聚焦机器学习的连续数学方法

专知

3+阅读 · 2019年12月8日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Score Matched Neural Exponential Families for Likelihood-Free Inference

Arxiv

0+阅读 · 2022年1月31日

On a linearization of quadratic Wasserstein distance

Arxiv

0+阅读 · 2022年1月31日

Saddle-to-Saddle Dynamics in Deep Linear Networks: Small Initialization Training, Symmetry, and Sparsity

Arxiv

0+阅读 · 2022年1月31日

Resultant Tools for Parametric Polynomial Systems with Application to Population Models

Arxiv

0+阅读 · 2022年1月31日

Low-Rank Updates of Matrix Square Roots

Arxiv

0+阅读 · 2022年1月31日

On the Global Convergence of Particle Swarm Optimization Methods

Arxiv

0+阅读 · 2022年1月29日

Generalized statistics: applications to data inverse problems with outlier-resistance

Arxiv

0+阅读 · 2022年1月28日

Certified dimension reduction in nonlinear Bayesian inverse problems

Arxiv

0+阅读 · 2022年1月28日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

Products of Euclidean metrics and applications to proximity questions among curves

Arxiv

3+阅读 · 2020年4月13日

VIP会员

文章信息

相关主题

相关VIP内容

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

深度概率图模型，Deep Probabilistic Models

专知会员服务

29+阅读 · 2021年8月2日

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

Fariz Darari简明《博弈论Game Theory》介绍，35页ppt

专知会员服务

111+阅读 · 2020年5月15日

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

来自Fariz Darari博士的一份简明《神经网络与深度学习》的讲义，64页ppt

专知会员服务

92+阅读 · 2020年5月5日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《美国海军陆战队软件定义网络应用案例：分布式防火墙自动化系统》148页

《多体环境下定位导航授时（PNT）系统研究》228页

软件定义无线电（SDR）：商业与军事领域的技术、应用及未来趋势

《攻势防空作战中无人追击者/规避者最优轨迹研究（含动态交战区建模）》95页

相关资讯

【斯坦福CS205L新课程】聚焦机器学习的连续数学方法

【斯坦福CS205L新课程】聚焦机器学习的连续数学方法

专知

3+阅读 · 2019年12月8日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

神经网络学习率设置

神经网络学习率设置

机器学习研究会

4+阅读 · 2018年3月3日

carla 体验效果及代码

carla 体验效果及代码

CreateAMind

7+阅读 · 2018年2月3日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Score Matched Neural Exponential Families for Likelihood-Free Inference

Arxiv

0+阅读 · 2022年1月31日

On a linearization of quadratic Wasserstein distance

Arxiv

0+阅读 · 2022年1月31日

Saddle-to-Saddle Dynamics in Deep Linear Networks: Small Initialization Training, Symmetry, and Sparsity

Arxiv

0+阅读 · 2022年1月31日

Resultant Tools for Parametric Polynomial Systems with Application to Population Models

Arxiv

0+阅读 · 2022年1月31日

Low-Rank Updates of Matrix Square Roots

Arxiv

0+阅读 · 2022年1月31日

On the Global Convergence of Particle Swarm Optimization Methods

Arxiv

0+阅读 · 2022年1月29日

Generalized statistics: applications to data inverse problems with outlier-resistance

Arxiv

0+阅读 · 2022年1月28日

Certified dimension reduction in nonlinear Bayesian inverse problems

Arxiv

0+阅读 · 2022年1月28日

Matrix Decomposition and Applications

Arxiv

54+阅读 · 2022年1月1日

Products of Euclidean metrics and applications to proximity questions among curves

Arxiv

3+阅读 · 2020年4月13日

微信扫码咨询专知VIP会员