【O'Reilly TensorFlow World 2019】有益的模块化卷积（Modular convolution considered beneficial），AMD的PMTS软件开发工程师Jack Chung，AMD软件开发人员Chao Liu，AMD软件架构师Daniel Lowell - 专知VIP

会员服务 ·

0

机器学习 · Jack Chung · TensorFlow · 卷积 · AMD ·

2019 年 11 月 14 日

【O'Reilly TensorFlow World 2019】有益的模块化卷积（Modular convolution considered beneficial），AMD的PMTS软件开发工程师Jack Chung，AMD软件开发人员Chao Liu，AMD软件架构师Daniel Lowell

专知会员服务

专知，提供专业可信的知识分发服务，让认知协作更快更好！

报告主题：Modular convolution considered beneficial

报告摘要：

miOpen包含性能关键的GPU内核，在AMD的ROCm平台上驱动机器学习的工作负载。Jack Chung、Chao Liu和Daniel Lowell探索了如何将它们制作成模块，这样它们就可以很容易地为来自AMD的各种GPU硬件进行调整，并与TensorFlow XLA等图形编译器紧密结合。他们展示了各种卷积算法是如何在AMD的硬件上实现的，如何将它们分解成模块，如何被XLA提取和融合，以及如何执行。

邀请嘉宾：

郑文衡(Jack Chung)是AMD的PMTS软件开发工程师，他从ROCm堆栈的早期开始就在那里工作。他有编译器前端、优化传递和高级语言的运行时方面的经验。他的重点是TensorFlow XLA。

Chao Liu是AMD的一名软件开发人员，他在AMD从事开源高性能深度学习库miOpen的工作。他的兴趣包括开发并行算法和各种应用的数值方法，包括深度学习和基于物理的仿真。在此之前，他开发了计算流体动力学、有限元分析、迭代求解和网格生成等技术。

Daniel Lowell是AMD的深度学习GPU内核库miOpen的团队领导和软件架构师。在此之前，他曾在AMD的高性能计算(HPC)领域从事编译技术和可靠性方面的研究。他的兴趣包括深度学习、脑机接口、自动代码生成和高性能计算。

成为VIP会员查看完整内容

Modular convolution considered beneficial Presentation.pdf

3

相关内容

机器学习

“机器学习是近20多年兴起的一门多领域交叉学科，涉及概率论、统计学、逼近论、凸分析、算法复杂度理论等多门学科。机器学习理论主要是设计和分析一些让可以自动“ 学习”的算法。机器学习算法是一类从数据中自动分析获得规律，并利用规律对未知数据进行预测的算法。因为学习算法中涉及了大量的统计学理论，机器学习与统计推断学联系尤为密切，也被称为统计学习理论。算法设计方面，机器学习理论关注可以实现的，行之有效的学习算法。很多推论问题属于无程序可循难度，所以部分的机器学习研究是开发容易处理的近似算法。” ——中文维基百科

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow，开源和IBM（TensorFlow, open source, and IBM ），IBM | Fred Reiss

【O'Reilly TensorFlow Conference 2019】TensorFlow，开源和IBM（TensorFlow, open source, and IBM ），IBM | Fred Reiss

专知会员服务

11+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow社区公告（TensorFlow community announcements），Google TensorFlow产品总监Kemal El Moujahid

【O'Reilly TensorFlow Conference 2019】TensorFlow社区公告（TensorFlow community announcements），Google TensorFlow产品总监Kemal El Moujahid

专知会员服务

6+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow.js：用java实现机器学习（TensorFlow.js: Bringing machine learning to JavaScript），Google | Sandeep Gupta，蒙特利尔大学 | Joseph Paul Cohen

【O'Reilly TensorFlow Conference 2019】TensorFlow.js：用java实现机器学习（TensorFlow.js: Bringing machine learning to JavaScript），Google | Sandeep Gupta，蒙特利尔大学 | Joseph Paul Cohen

专知会员服务

7+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】恶意软件检测（Generative malware outbreak detection），Sean Park | Trend Micro

【O'Reilly TensorFlow Conference 2019】恶意软件检测（Generative malware outbreak detection），Sean Park | Trend Micro

专知会员服务

15+阅读 · 2019年11月13日

【O'Reilly TensorFlow Conference 2019】使用TensorFlow服务的高级模型部署（Advanced model deployments with TensorFlow Serving），谷歌开发专家Hannes Hapke

【O'Reilly TensorFlow Conference 2019】使用TensorFlow服务的高级模型部署（Advanced model deployments with TensorFlow Serving），谷歌开发专家Hannes Hapke

专知会员服务

23+阅读 · 2019年11月13日

【Amazon AWS】深度学习编译器（Deep Learning Compiler），附35页ppt

【Amazon AWS】深度学习编译器（Deep Learning Compiler），附35页ppt

专知会员服务

43+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】大规模构建和部署AI应用程序和系统（Building and deploying AI applications and systems at scale），O'Reilly的首席数据科学家Ben Lorica、Computable 联合创始人兼首席执行官Roger Chen

【O'Reilly AI Conference 2019】大规模构建和部署AI应用程序和系统（Building and deploying AI applications and systems at scale），O'Reilly的首席数据科学家Ben Lorica、Computable 联合创始人兼首席执行官Roger Chen

专知会员服务

25+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】在边缘部署机器学习模型（Deploying machine learning models on the edge），Yan Zhang (Microsoft), Mathew Salvaris (Microsoft)

【O'Reilly AI Conference 2019】在边缘部署机器学习模型（Deploying machine learning models on the edge），Yan Zhang (Microsoft), Mathew Salvaris (Microsoft)

专知会员服务

19+阅读 · 2019年11月5日

谷歌将AutoML应用于Transformer架构，翻译结果飙升，已开源！

谷歌将AutoML应用于Transformer架构，翻译结果飙升，已开源！

数据派THU

5+阅读 · 2019年6月21日

史上最小！纳米级无人机仅重27克，CNN自主导航，已开源！

史上最小！纳米级无人机仅重27克，CNN自主导航，已开源！

全球人工智能

8+阅读 · 2019年5月29日

【泡泡图灵智库】GCNv2：高效关联预测实时SLAM（arXiv）

【泡泡图灵智库】GCNv2：高效关联预测实时SLAM（arXiv）

泡泡机器人SLAM

45+阅读 · 2019年4月15日

【泡泡图灵智库】RTAB-Map : 一个大规模且长期在线的激光与视觉SLAM开源库

【泡泡图灵智库】RTAB-Map : 一个大规模且长期在线的激光与视觉SLAM开源库

泡泡机器人SLAM

34+阅读 · 2018年12月25日

Java开发者必看！机器学习开发库精选

Java开发者必看！机器学习开发库精选

云栖社区

5+阅读 · 2018年8月22日

陈天奇团队推出开源AI芯片栈VTA，降低芯片设计门槛

陈天奇团队推出开源AI芯片栈VTA，降低芯片设计门槛

AI前线

15+阅读 · 2018年7月13日

6月5日凌晨开幕！扒一扒苹果WWDC的秘密战略

6月5日凌晨开幕！扒一扒苹果WWDC的秘密战略

全球人工智能

3+阅读 · 2018年6月4日

从15000个Python开源项目中精选的Top30，Github平均star为3707，赶紧收藏！

从15000个Python开源项目中精选的Top30，Github平均star为3707，赶紧收藏！

量化投资与机器学习

5+阅读 · 2018年1月16日

2017年四巨头的深度学习框架之战，你支持谁？

2017年四巨头的深度学习框架之战，你支持谁？

全球人工智能

6+阅读 · 2017年12月29日

分布式机器学习平台比较

分布式机器学习平台比较

云栖社区

4+阅读 · 2017年8月13日

Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction

Arxiv

9+阅读 · 2019年10月12日

CoCoNet: A Collaborative Convolutional Network

CoCoNet: A Collaborative Convolutional Network

Arxiv

6+阅读 · 2019年1月28日

Knowledge-enriched Two-layered Attention Network for Sentiment Analysis

Arxiv

4+阅读 · 2018年6月16日

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Arxiv

14+阅读 · 2018年6月6日

Sentiment Analysis of Arabic Tweets: Feature Engineering and A Hybrid Approach

Arxiv

7+阅读 · 2018年5月22日

Quantum generative adversarial networks

Arxiv

4+阅读 · 2018年4月30日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

Beyond Patient Monitoring: Conversational Agents Role in Telemedicine & Healthcare Support For Home-Living Elderly Individuals

Arxiv

3+阅读 · 2018年3月3日

Arxiv

8+阅读 · 2018年1月25日

Fluorescence Microscopy Image Segmentation Using Convolutional Neural Network With Generative Adversarial Networks

Arxiv

18+阅读 · 2018年1月22日

VIP会员

相关主题

相关VIP内容

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

如何加速NVIDIA gpu上的训练、推理和ML应用？108页ppt，Accelerating training, inference, and ML applications on NVIDIA GPUs

专知会员服务

61+阅读 · 2019年12月29日

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

【O'Reilly TensorFlow Conference 2019】基于TensorFlow的实时流数据机器学习（Machine learning over real-time streaming data with TensorFlow）

专知会员服务

28+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow，开源和IBM（TensorFlow, open source, and IBM ），IBM | Fred Reiss

【O'Reilly TensorFlow Conference 2019】TensorFlow，开源和IBM（TensorFlow, open source, and IBM ），IBM | Fred Reiss

专知会员服务

11+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow社区公告（TensorFlow community announcements），Google TensorFlow产品总监Kemal El Moujahid

【O'Reilly TensorFlow Conference 2019】TensorFlow社区公告（TensorFlow community announcements），Google TensorFlow产品总监Kemal El Moujahid

专知会员服务

6+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】TensorFlow.js：用java实现机器学习（TensorFlow.js: Bringing machine learning to JavaScript），Google | Sandeep Gupta，蒙特利尔大学 | Joseph Paul Cohen

【O'Reilly TensorFlow Conference 2019】TensorFlow.js：用java实现机器学习（TensorFlow.js: Bringing machine learning to JavaScript），Google | Sandeep Gupta，蒙特利尔大学 | Joseph Paul Cohen

专知会员服务

7+阅读 · 2019年11月14日

【O'Reilly TensorFlow Conference 2019】恶意软件检测（Generative malware outbreak detection），Sean Park | Trend Micro

【O'Reilly TensorFlow Conference 2019】恶意软件检测（Generative malware outbreak detection），Sean Park | Trend Micro

专知会员服务

15+阅读 · 2019年11月13日

【O'Reilly TensorFlow Conference 2019】使用TensorFlow服务的高级模型部署（Advanced model deployments with TensorFlow Serving），谷歌开发专家Hannes Hapke

【O'Reilly TensorFlow Conference 2019】使用TensorFlow服务的高级模型部署（Advanced model deployments with TensorFlow Serving），谷歌开发专家Hannes Hapke

专知会员服务

23+阅读 · 2019年11月13日

【Amazon AWS】深度学习编译器（Deep Learning Compiler），附35页ppt

【Amazon AWS】深度学习编译器（Deep Learning Compiler），附35页ppt

专知会员服务

43+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】大规模构建和部署AI应用程序和系统（Building and deploying AI applications and systems at scale），O'Reilly的首席数据科学家Ben Lorica、Computable 联合创始人兼首席执行官Roger Chen

【O'Reilly AI Conference 2019】大规模构建和部署AI应用程序和系统（Building and deploying AI applications and systems at scale），O'Reilly的首席数据科学家Ben Lorica、Computable 联合创始人兼首席执行官Roger Chen

专知会员服务

25+阅读 · 2019年11月5日

【O'Reilly AI Conference 2019】在边缘部署机器学习模型（Deploying machine learning models on the edge），Yan Zhang (Microsoft), Mathew Salvaris (Microsoft)

【O'Reilly AI Conference 2019】在边缘部署机器学习模型（Deploying machine learning models on the edge），Yan Zhang (Microsoft), Mathew Salvaris (Microsoft)

专知会员服务

19+阅读 · 2019年11月5日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

谷歌将AutoML应用于Transformer架构，翻译结果飙升，已开源！

谷歌将AutoML应用于Transformer架构，翻译结果飙升，已开源！

数据派THU

5+阅读 · 2019年6月21日

史上最小！纳米级无人机仅重27克，CNN自主导航，已开源！

史上最小！纳米级无人机仅重27克，CNN自主导航，已开源！

全球人工智能

8+阅读 · 2019年5月29日

【泡泡图灵智库】GCNv2：高效关联预测实时SLAM（arXiv）

【泡泡图灵智库】GCNv2：高效关联预测实时SLAM（arXiv）

泡泡机器人SLAM

45+阅读 · 2019年4月15日

【泡泡图灵智库】RTAB-Map : 一个大规模且长期在线的激光与视觉SLAM开源库

【泡泡图灵智库】RTAB-Map : 一个大规模且长期在线的激光与视觉SLAM开源库

泡泡机器人SLAM

34+阅读 · 2018年12月25日

Java开发者必看！机器学习开发库精选

Java开发者必看！机器学习开发库精选

云栖社区

5+阅读 · 2018年8月22日

陈天奇团队推出开源AI芯片栈VTA，降低芯片设计门槛

陈天奇团队推出开源AI芯片栈VTA，降低芯片设计门槛

AI前线

15+阅读 · 2018年7月13日

6月5日凌晨开幕！扒一扒苹果WWDC的秘密战略

6月5日凌晨开幕！扒一扒苹果WWDC的秘密战略

全球人工智能

3+阅读 · 2018年6月4日

从15000个Python开源项目中精选的Top30，Github平均star为3707，赶紧收藏！

从15000个Python开源项目中精选的Top30，Github平均star为3707，赶紧收藏！

量化投资与机器学习

5+阅读 · 2018年1月16日

2017年四巨头的深度学习框架之战，你支持谁？

2017年四巨头的深度学习框架之战，你支持谁？

全球人工智能

6+阅读 · 2017年12月29日

分布式机器学习平台比较

分布式机器学习平台比较

云栖社区

4+阅读 · 2017年8月13日

相关论文

Fi-GNN: Modeling Feature Interactions via Graph Neural Networks for CTR Prediction

Arxiv

9+阅读 · 2019年10月12日

CoCoNet: A Collaborative Convolutional Network

CoCoNet: A Collaborative Convolutional Network

Arxiv

6+阅读 · 2019年1月28日

Knowledge-enriched Two-layered Attention Network for Sentiment Analysis

Arxiv

4+阅读 · 2018年6月16日

Graph Convolutional Neural Networks for Web-Scale Recommender Systems

Arxiv

14+阅读 · 2018年6月6日

Sentiment Analysis of Arabic Tweets: Feature Engineering and A Hybrid Approach

Arxiv

7+阅读 · 2018年5月22日

Quantum generative adversarial networks

Arxiv

4+阅读 · 2018年4月30日

Adaptive Correlation Filters with Long-Term and Short-Term Memory for Object Tracking

Arxiv

11+阅读 · 2018年3月23日

Beyond Patient Monitoring: Conversational Agents Role in Telemedicine & Healthcare Support For Home-Living Elderly Individuals

Arxiv

3+阅读 · 2018年3月3日

Arxiv

8+阅读 · 2018年1月25日

Fluorescence Microscopy Image Segmentation Using Convolutional Neural Network With Generative Adversarial Networks

Arxiv

18+阅读 · 2018年1月22日

微信扫码咨询专知VIP会员