FusedMM: 用于图形嵌入和图形神经网络的统一SDDMM-SpMM内核 (FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks) - 专知论文

会员服务 ·

0

Performer · 图 · 核化 · 图形处理器 · Neural Networks ·

2021 年 10 月 27 日

FusedMM: A Unified SDDMM-SpMM Kernel for Graph Embedding and Graph Neural Networks

翻译：FusedMM: 用于图形嵌入和图形神经网络的统一SDDMM-SpMM内核

Md. Khaledur Rahman,Majedul Haque Sujon,Ariful Azad

from arxiv, 11 pages, published in IEEE IPDPS 2021

We develop a fused matrix multiplication kernel that unifies sampled dense-dense matrix multiplication and sparse-dense matrix multiplication under a single operation called FusedMM. By using user-defined functions, FusedMM can capture almost all computational patterns needed by popular graph embedding and GNN approaches. FusedMM is an order of magnitude faster than its equivalent kernels in Deep Graph Library. The superior performance of FusedMM comes from the low-level vectorized kernels, a suitable load balancing scheme and an efficient utilization of the memory bandwidth. FusedMM can tune its performance using a code generator and perform equally well on Intel, AMD and ARM processors. FusedMM speeds up an end-to-end graph embedding algorithm by up to 28x on different processors.

翻译：我们开发了一个集成矩阵倍增内核, 将取样的密度密集矩阵倍增和稀有密度矩阵倍增在一个称为FUTMM的单一操作下进行。通过使用用户定义的功能, FUTMM 可以捕捉流行图形嵌入和 GNN 方法所需的几乎所有计算模式。 FUTMM 比深图库中的等效内核快得多。 FUTMM 的高级性能来自低水平的矢量内核、适当的负载平衡方案和有效使用记忆带宽。 FUTMM 可以使用代码生成器调节其性能,并在英特尔、AMD 和 ARM 处理器上同样运行良好。 FUTMM 加速一个端到端的图形嵌入算法, 在不同处理器上速度高达28x。

0

相关内容

Performer

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

从图(Graph)到图卷积(Graph Convolution)：漫谈图神经网络模型

专知会员服务

97+阅读 · 2020年2月21日

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

专知会员服务

71+阅读 · 2020年2月5日

【图机器学习论文】图神经网络的逻辑表达性（Logical Expressiveness of Graph Neural Networks）

【图机器学习论文】图神经网络的逻辑表达性（Logical Expressiveness of Graph Neural Networks）

专知会员服务

41+阅读 · 2019年12月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 知识图谱文献集合

Github项目推荐 | 知识图谱文献集合

AI研习社

26+阅读 · 2019年4月12日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

Windows操作系统全面兼容机器人操作系统ROS1和ROS2

Windows操作系统全面兼容机器人操作系统ROS1和ROS2

无人机

5+阅读 · 2018年10月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】基于TVM工具链的深度学习编译器 NNVM compiler发布

【推荐】基于TVM工具链的深度学习编译器 NNVM compiler发布

机器学习研究会

5+阅读 · 2017年10月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

Cyberattack Detection in Large-Scale Smart Grids using Chebyshev Graph Convolutional Networks

Arxiv

0+阅读 · 2021年12月25日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

12+阅读 · 2021年2月19日

Identity-aware Graph Neural Networks

Identity-aware Graph Neural Networks

Arxiv

14+阅读 · 2021年1月25日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

Scalable Graph Neural Networks via Bidirectional Propagation

Arxiv

16+阅读 · 2020年10月29日

Directed Graph Convolutional Network

Arxiv

3+阅读 · 2020年4月29日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

MMKG: Multi-Modal Knowledge Graphs

Arxiv

30+阅读 · 2019年3月13日

Large-Scale Learnable Graph Convolutional Networks

Arxiv

3+阅读 · 2018年8月12日

VIP会员

文章信息

相关主题

图形处理器

Neural Networks

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知会员服务

90+阅读 · 2020年7月22日

从图(Graph)到图卷积(Graph Convolution)：漫谈图神经网络模型

专知会员服务

97+阅读 · 2020年2月21日

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

【中科大】上下文感知推荐系统的图卷积机：Graph Convolution Machine for Context-aware Recommender System

专知会员服务

71+阅读 · 2020年2月5日

【图机器学习论文】图神经网络的逻辑表达性（Logical Expressiveness of Graph Neural Networks）

【图机器学习论文】图神经网络的逻辑表达性（Logical Expressiveness of Graph Neural Networks）

专知会员服务

41+阅读 · 2019年12月30日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

【KDD2020】更深的图神经网络，Towards Deeper Graph Neural Networks

专知

45+阅读 · 2020年7月22日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Github项目推荐 | 知识图谱文献集合

Github项目推荐 | 知识图谱文献集合

AI研习社

26+阅读 · 2019年4月12日

Facebook PyText 在 Github 上开源了

Facebook PyText 在 Github 上开源了

AINLP

7+阅读 · 2018年12月14日

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

pytorch-pretrained-BERT：BERT PyTorch实现，可加载Google BERT预训练模型

AINLP

35+阅读 · 2018年11月6日

Windows操作系统全面兼容机器人操作系统ROS1和ROS2

Windows操作系统全面兼容机器人操作系统ROS1和ROS2

无人机

5+阅读 · 2018年10月4日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】基于TVM工具链的深度学习编译器 NNVM compiler发布

【推荐】基于TVM工具链的深度学习编译器 NNVM compiler发布

机器学习研究会

5+阅读 · 2017年10月7日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

相关论文

Cyberattack Detection in Large-Scale Smart Grids using Chebyshev Graph Convolutional Networks

Arxiv

0+阅读 · 2021年12月25日

Interest-aware Message-Passing GCN for Recommendation

Interest-aware Message-Passing GCN for Recommendation

Arxiv

12+阅读 · 2021年2月19日

Identity-aware Graph Neural Networks

Identity-aware Graph Neural Networks

Arxiv

14+阅读 · 2021年1月25日

Directional Graph Networks

Directional Graph Networks

Arxiv

27+阅读 · 2020年12月10日

Scalable Graph Neural Networks via Bidirectional Propagation

Arxiv

16+阅读 · 2020年10月29日

Directed Graph Convolutional Network

Arxiv

3+阅读 · 2020年4月29日

Hierarchical Graph Pooling with Structure Learning

Arxiv

13+阅读 · 2019年11月14日

Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

Arxiv

8+阅读 · 2019年11月4日

MMKG: Multi-Modal Knowledge Graphs

Arxiv

30+阅读 · 2019年3月13日

Large-Scale Learnable Graph Convolutional Networks

Arxiv

3+阅读 · 2018年8月12日

微信扫码咨询专知VIP会员