Boltzmann 配对互动旋转系统配制的 Boltzmann 分布的自动递退神经网络结构 (The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems) - 专知论文

会员服务 ·

0

INTERACT · 成对型 · Networking · Neural Networks · MoDELS ·

2023 年 2 月 16 日

The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

翻译：Boltzmann 配对互动旋转系统配制的 Boltzmann 分布的自动递退神经网络结构

from arxiv, 10 pages, 6 figure plus the Supplementary Information

Generative Autoregressive Neural Networks (ARNN) have recently demonstrated exceptional results in image and language generation tasks, contributing to the growing popularity of generative models in both scientific and commercial applications. This work presents a physical interpretation of the ARNNs by reformulating the Boltzmann distribution of binary pairwise interacting systems into autoregressive form. The resulting ARNN architecture has weights and biases of its first layer corresponding to the Hamiltonian's couplings and external fields, featuring widely used structures like the residual connections and a recurrent architecture with clear physical meanings. However, the exponential growth, with system size, of the number of parameters of the hidden layers makes its direct application unfeasible. Nevertheless, its architecture's explicit formulation allows using statistical physics techniques to derive new ARNNs for specific systems. As examples, new effective ARNN architectures are derived from two well-known mean-field systems, the Curie-Weiss and Sherrington-Kirkpatrick models, showing superior performances in approximating the Boltzmann distributions of the corresponding physics model than other commonly used ARNNs architectures. The connection established between the physics of the system and the ARNN architecture provides a way to derive new neural network architectures for different interacting systems and interpret existing ones from a physical perspective.

翻译：生成自动递增神经网络(ARNN)最近在图像和语言生成任务方面展示出非同寻常的结果,有助于在科学和商业应用中日益普及基因模型。这项工作通过将布尔茨曼双向互动系统的二进制配对互动系统重新配制成自动递减形式,对ARNN作了物理解释。由此形成的ARNN结构的第一层具有与汉密尔顿的组合和外部领域相对应的分量和偏差,其第一层与汉密尔顿的组合和外部领域相对应,其特点是广泛使用的结构,如残余连接和具有明确物理意义的经常结构。然而,随着系统规模的指数增长,隐藏层参数数量的指数增长使得其直接应用变得不可行。然而,其结构的清晰配置允许使用统计物理技术为特定系统开发新的ARNNNS。例如,新的有效的ARNN结构源自两个众所周知的中位系统,即Curie-Weiss和Sherrington-Kirkpatrick模型,在接近波尔茨曼的物理模型分布上表现出优异性的表现,而不是通常使用的ARNNIS结构,为ARN的新的结构提供不同的连接。

0

相关内容

INTERACT

IFIP TC13 Conference on Human-Computer Interaction是人机交互领域的研究者和实践者展示其工作的重要平台。多年来，这些会议吸引了来自几个国家和文化的研究人员。官网链接：http://interact2019.org/

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

骨髓源性巨噬细胞microRNA-155对动脉粥样硬化的调控机制

国家自然科学基金

0+阅读 · 2016年12月31日

超小超顺磁性氧化铁标记树突状细胞对兔动脉粥样硬化的磁共振成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

VSTM1调控单核/巨噬细胞功能及动脉粥样硬化发生发展的体内外研究

国家自然科学基金

0+阅读 · 2014年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

马尾松高抗旱家系应答干旱胁迫的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

引力势的量子修正

国家自然科学基金

0+阅读 · 2012年12月31日

考虑表/界面效应的微纳米压电声子晶体波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tecto调节非洲爪蛙胚层决定与分化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

巨噬细胞MsrA的靶向调控对动脉粥样硬化的干预研究

国家自然科学基金

0+阅读 · 2009年12月31日

旋转玻色爱因斯坦凝聚体的涡旋态理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

Fundamental limits and algorithms for sparse linear regression with sublinear sparsity

Arxiv

0+阅读 · 2023年4月8日

Doubly Stochastic Matrix Models for Estimation of Distribution Algorithms

Arxiv

0+阅读 · 2023年4月5日

Distributed Logistic Regression for Massive Data with Rare Events

Arxiv

0+阅读 · 2023年4月5日

Neural Networks for Extreme Quantile Regression with an Application to Forecasting of Flood Risk

Arxiv

0+阅读 · 2023年4月4日

Distribution-Free Location-Scale Regression

Arxiv

0+阅读 · 2023年4月4日

Artificial neural networks and time series of counts: A class of nonlinear INGARCH models

Arxiv

0+阅读 · 2023年4月3日

Model-Agnostic Reachability Analysis on Deep Neural Networks

Arxiv

0+阅读 · 2023年4月3日

Low-complexity Deep Video Compression with A Distributed Coding Architecture

Arxiv

0+阅读 · 2023年4月2日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Graph Neural Networks: Architectures, Stability and Transferability

Arxiv

13+阅读 · 2020年8月4日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

深度学习优化算法，73页ppt，Optimization Algorithms on Deep Learning

专知会员服务

135+阅读 · 2021年6月16日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

【深度学习架构、模型和技巧集合(TensorFlow/PyTorch)】’Deep Learning Models - A collection of various deep learning architectures, models, and tips'

专知会员服务

58+阅读 · 2020年1月25日

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

Aspect-Oriented Syntax Network for Aspect-Based Sentiment Analysis，中山大学数据科学与计算机学院权小军教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

19+阅读 · 2019年10月22日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【推荐】深度学习目标检测概览

【推荐】深度学习目标检测概览

机器学习研究会

10+阅读 · 2017年9月1日

相关论文

Fundamental limits and algorithms for sparse linear regression with sublinear sparsity

Arxiv

0+阅读 · 2023年4月8日

Doubly Stochastic Matrix Models for Estimation of Distribution Algorithms

Arxiv

0+阅读 · 2023年4月5日

Distributed Logistic Regression for Massive Data with Rare Events

Arxiv

0+阅读 · 2023年4月5日

Neural Networks for Extreme Quantile Regression with an Application to Forecasting of Flood Risk

Arxiv

0+阅读 · 2023年4月4日

Distribution-Free Location-Scale Regression

Arxiv

0+阅读 · 2023年4月4日

Artificial neural networks and time series of counts: A class of nonlinear INGARCH models

Arxiv

0+阅读 · 2023年4月3日

Model-Agnostic Reachability Analysis on Deep Neural Networks

Arxiv

0+阅读 · 2023年4月3日

Low-complexity Deep Video Compression with A Distributed Coding Architecture

Arxiv

0+阅读 · 2023年4月2日

Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks

Arxiv

10+阅读 · 2022年2月10日

Graph Neural Networks: Architectures, Stability and Transferability

Arxiv

13+阅读 · 2020年8月4日

相关基金

骨髓源性巨噬细胞microRNA-155对动脉粥样硬化的调控机制

国家自然科学基金

0+阅读 · 2016年12月31日

超小超顺磁性氧化铁标记树突状细胞对兔动脉粥样硬化的磁共振成像研究

国家自然科学基金

0+阅读 · 2014年12月31日

VSTM1调控单核/巨噬细胞功能及动脉粥样硬化发生发展的体内外研究

国家自然科学基金

0+阅读 · 2014年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

马尾松高抗旱家系应答干旱胁迫的分子机理

国家自然科学基金

0+阅读 · 2012年12月31日

引力势的量子修正

国家自然科学基金

0+阅读 · 2012年12月31日

考虑表/界面效应的微纳米压电声子晶体波传播特性研究

国家自然科学基金

0+阅读 · 2012年12月31日

Tecto调节非洲爪蛙胚层决定与分化的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

巨噬细胞MsrA的靶向调控对动脉粥样硬化的干预研究

国家自然科学基金

0+阅读 · 2009年12月31日

旋转玻色爱因斯坦凝聚体的涡旋态理论研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员