基于能量约束扩散诱导的可扩展(图)Transformer：DIFFormer (DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion) - 专知论文

会员服务 ·

0

DIFFormer · 扩散过程 · 约束 · 结构 · Transformer ·

2023 年 4 月 4 日

DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion

翻译：基于能量约束扩散诱导的可扩展(图)Transformer：DIFFormer

Qitian Wu,Chenxiao Yang,Wentao Zhao,Yixuan He,David Wipf,Junchi Yan

from arxiv, Published at ICLR 2023 as a spotlight presentation, the implementation code is available at https://github.com/qitianwu/DIFFormer

Real-world data generation often involves complex inter-dependencies among instances, violating the IID-data hypothesis of standard learning paradigms and posing a challenge for uncovering the geometric structures for learning desired instance representations. To this end, we introduce an energy constrained diffusion model which encodes a batch of instances from a dataset into evolutionary states that progressively incorporate other instances' information by their interactions. The diffusion process is constrained by descent criteria w.r.t.~a principled energy function that characterizes the global consistency of instance representations over latent structures. We provide rigorous theory that implies closed-form optimal estimates for the pairwise diffusion strength among arbitrary instance pairs, which gives rise to a new class of neural encoders, dubbed as DIFFormer (diffusion-based Transformers), with two instantiations: a simple version with linear complexity for prohibitive instance numbers, and an advanced version for learning complex structures. Experiments highlight the wide applicability of our model as a general-purpose encoder backbone with superior performance in various tasks, such as node classification on large graphs, semi-supervised image/text classification, and spatial-temporal dynamics prediction.

翻译：真实世界中的数据生成常常涉及到实例之间的复杂相互依赖关系，违反了标准学习范例中的独立同分布数据假设，因此为了学习所需的实例表示，需要揭示几何结构。基于此，我们引入了一种能量约束的扩散模型，该模型将来自数据集的一批实例编码到逐渐涵盖其他实例信息的进化状态中，其相互作用方式促进了扩散过程。扩散过程受到关于基于潜在结构的实例表示的全局一致性的原则能量函数内在的限制。我们提供了严格的理论，它暗示了任意实例对之间最优扩散强度的闭式估计，这产生了一类新的神经编码器，即DIFFormer(diffusion-based Transformers)。DIFFormer有两个版本：一种用于处理禁止性实例数量的简单版本，复杂度为线性。另一种版本则适用于学习复杂结构。实验突出了我们的模型作为通用编码器骨干的广泛适用性，其在各种任务中均表现出卓越的性能，例如大型图的节点分类，半监督的图像/文本分类和时空动态预测。

0

相关内容

DIFFormer

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

DiffRec: 扩散推荐模型（SIGIR'23）

DiffRec: 扩散推荐模型（SIGIR'23）

专知会员服务

48+阅读 · 2023年4月16日

【ICLR2023】DIFFormer:由能量约束扩散诱导的可扩展(图)Transformer

【ICLR2023】DIFFormer:由能量约束扩散诱导的可扩展(图)Transformer

专知会员服务

21+阅读 · 2023年1月24日

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

63+阅读 · 2023年1月5日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【AAAI2020-中山大学】知识图谱迁移网络小样本识别，Knowledge Graph Transfer Network for Few-Shot Recognition(附pdf）

【AAAI2020-中山大学】知识图谱迁移网络小样本识别，Knowledge Graph Transfer Network for Few-Shot Recognition(附pdf）

专知会员服务

102+阅读 · 2019年11月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

扩展离散可积系统的构造、求解及应用

国家自然科学基金

0+阅读 · 2014年12月31日

非齐次非局部扩散方程的稳态解和周期解

国家自然科学基金

0+阅读 · 2013年12月31日

基于晶格点缺陷的二维Frenkel-Kontorova模型耗散动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

通过扩展Hubbard模型研究低维关联电子系统的量子特性

国家自然科学基金

0+阅读 · 2012年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

受时变对流扩散方程约束的最优控制问题的SUPG方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

时空曲率对量子效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号调控不同分化状态间充质干细胞定向迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强复合材料结构损伤演化和破坏的FEM-VCFEM-MD多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年5月24日

MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks

Arxiv

0+阅读 · 2023年5月23日

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

Arxiv

0+阅读 · 2023年5月23日

3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding

Arxiv

0+阅读 · 2023年5月22日

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

Arxiv

0+阅读 · 2023年5月22日

Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion

Arxiv

0+阅读 · 2023年5月19日

Constrained Environment Optimization for Prioritized Multi-Agent Navigation

Arxiv

0+阅读 · 2023年5月18日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Advances and Challenges in Conversational Recommender Systems: A Survey

Arxiv

14+阅读 · 2021年1月23日

VIP会员

文章信息

相关主题

相关VIP内容

【2023新书】随机模型基础，815页pdf

【2023新书】随机模型基础，815页pdf

专知会员服务

104+阅读 · 2023年5月10日

DiffRec: 扩散推荐模型（SIGIR'23）

DiffRec: 扩散推荐模型（SIGIR'23）

专知会员服务

48+阅读 · 2023年4月16日

【ICLR2023】DIFFormer:由能量约束扩散诱导的可扩展(图)Transformer

【ICLR2023】DIFFormer:由能量约束扩散诱导的可扩展(图)Transformer

专知会员服务

21+阅读 · 2023年1月24日

Graph Transformer近期进展

Graph Transformer近期进展

专知会员服务

63+阅读 · 2023年1月5日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

【NeurIPS2021】用于文本图表示学习的 GNN 嵌套 Transformer 模型：GraphFormers

专知会员服务

46+阅读 · 2021年11月24日

【AAAI2020-中山大学】知识图谱迁移网络小样本识别，Knowledge Graph Transfer Network for Few-Shot Recognition(附pdf）

【AAAI2020-中山大学】知识图谱迁移网络小样本识别，Knowledge Graph Transfer Network for Few-Shot Recognition(附pdf）

专知会员服务

102+阅读 · 2019年11月24日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

新书册《几何深度学习的数学基础》

中程单向攻击无人机的战略意义：俄乌战争启示

在无标注条件下适配视觉—语言模型：全面综述

面向视觉语言模型的持续学习：遗忘之外的综述与分类体系

相关资讯

GNN 新基准！Long Range Graph Benchmark

GNN 新基准！Long Range Graph Benchmark

图与推荐

0+阅读 · 2022年10月18日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

vae 相关论文表示学习 1

vae 相关论文表示学习 1

CreateAMind

12+阅读 · 2018年9月6日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

【论文推荐】最新八篇生成对抗网络相关论文—BRE、图像合成、多模态图像生成、非配对多域图、注意力、对抗特征增强、深度对抗性训练

专知

16+阅读 · 2018年5月14日

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

【论文推荐】最新六篇强化学习相关论文—Sublinear、机器阅读理解、加速强化学习、对抗性奖励学习、人机交互

专知

17+阅读 · 2018年4月28日

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

【论文推荐】最新六篇推荐系统相关论文—注意力机制、多任务、协同跨网络、非结构化文本、TransRev、章节推荐

专知

12+阅读 · 2018年4月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning

Arxiv

0+阅读 · 2023年5月25日

DiffBlender: Scalable and Composable Multimodal Text-to-Image Diffusion Models

Arxiv

0+阅读 · 2023年5月24日

MARC: A multi-agent robots control framework for enhancing reinforcement learning in construction tasks

Arxiv

0+阅读 · 2023年5月23日

SEEDS: Exponential SDE Solvers for Fast High-Quality Sampling from Diffusion Models

Arxiv

0+阅读 · 2023年5月23日

3D Rotation and Translation for Hyperbolic Knowledge Graph Embedding

Arxiv

0+阅读 · 2023年5月22日

ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer

Arxiv

0+阅读 · 2023年5月22日

Modeling Temporal Data as Continuous Functions with Stochastic Process Diffusion

Arxiv

0+阅读 · 2023年5月19日

Constrained Environment Optimization for Prioritized Multi-Agent Navigation

Arxiv

0+阅读 · 2023年5月18日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Advances and Challenges in Conversational Recommender Systems: A Survey

Arxiv

14+阅读 · 2021年1月23日

相关基金

扩展离散可积系统的构造、求解及应用

国家自然科学基金

0+阅读 · 2014年12月31日

非齐次非局部扩散方程的稳态解和周期解

国家自然科学基金

0+阅读 · 2013年12月31日

基于晶格点缺陷的二维Frenkel-Kontorova模型耗散动力学研究

国家自然科学基金

0+阅读 · 2013年12月31日

通过扩展Hubbard模型研究低维关联电子系统的量子特性

国家自然科学基金

0+阅读 · 2012年12月31日

向量优化问题的近似解的最优性条件

国家自然科学基金

0+阅读 · 2012年12月31日

受时变对流扩散方程约束的最优控制问题的SUPG方法研究

国家自然科学基金

0+阅读 · 2012年12月31日

时空曲率对量子效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

Wnt/β-catenin信号调控不同分化状态间充质干细胞定向迁移的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

颗粒增强复合材料结构损伤演化和破坏的FEM-VCFEM-MD多尺度模拟

国家自然科学基金

0+阅读 · 2012年12月31日

Curcumin双向调控HO-1/HO-2协同抑制Aβeme复合物防治AD的分子机制

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员