使用 Kokkos 加速对Exscale 系统进行 X射线跟踪 (Accelerating X-Ray Tracing for Exascale Systems using Kokkos) - 专知论文

会员服务 ·

0

迹 · 代码 · AMD · Performer · GPU ·

2022 年 5 月 16 日

Accelerating X-Ray Tracing for Exascale Systems using Kokkos

翻译：使用 Kokkos 加速对Exscale 系统进行 X射线跟踪

Felix Wittwer,Nicholas K. Sauter,Derek Mendez,Billy K. Poon,Aaron S. Brewster,James M. Holton,Michael E. Wall,William E. Hart,Deborah J. Bard,Johannes P. Blaschke

The upcoming exascale computing systems Frontier and Aurora will draw much of their computing power from GPU accelerators. The hardware for these systems will be provided by AMD and Intel, respectively, each supporting their own GPU programming model. The challenge for applications that harness one of these exascale systems will be to avoid lock-in and to preserve performance portability. We report here on our results of using Kokkos to accelerate a real-world application on NERSC's Perlmutter Phase 1 (using NVIDIA A100 accelerators) and the testbed system for OLCF's Frontier (using AMD MI250X). By porting to Kokkos, we were able to successfully run the same X-ray tracing code on both systems and achieved speed-ups between 13% and 66% compared to the original CUDA code. These results are a highly encouraging demonstration of using Kokkos to accelerate production science code.

翻译：即将推出的缩略计算系统 Frontier 和 Aurora 将会从 GPU 加速器中抽取其大部分计算能力。这些系统的硬件将分别由AMD 和 Intel 提供, 各自支持自己的 GPU 编程模型。使用这些缩略图的应用程序所面临的挑战将是避免锁定和保持性能可移动性。我们在此报告我们利用Kokkos 加速NERSC Perlmutter 第一阶段( 使用 NVIDIA A100 加速器) 和 OLCF 前沿测试系统( 使用 AMD MI250X ) 的实时应用的结果。通过移植到 Kokkos, 我们成功运行了两个系统的相同的X射线追踪代码,并实现了13至66%的超速率, 与原 CUDA 代码相比。这些结果非常令人鼓舞地展示了使用 Kokos 加速生产科学代码。

0

相关内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Stokes/Darcy 耦合问题的数值方法及预处理技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

几类高阶非线性行波方程的精确解,分支和复杂动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

结构化多项式系统的三角化求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场涡流问题中结构化线性方程组的预处理方法

国家自然科学基金

0+阅读 · 2013年12月31日

不可压缩流边界控制问题的可扩展并行算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

SIAH2抑制PEG10泛素化降解并影响肝癌生长和侵袭转移的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于两重网格的Navier-Stokes方程并行自适应后处理及变分多尺度算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

AI-enhanced iterative solvers for accelerating the solution of large scale parametrized linear systems of equations

Arxiv

0+阅读 · 2022年7月6日

Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling

Arxiv

0+阅读 · 2022年7月5日

The Neural-Prediction based Acceleration Algorithm of Column Generation for Graph-Based Set Covering Problems

The Neural-Prediction based Acceleration Algorithm of Column Generation for Graph-Based Set Covering Problems

Arxiv

0+阅读 · 2022年7月4日

Protea: Client Profiling within Federated Systems using Flower

Arxiv

0+阅读 · 2022年7月3日

Forming Real-World Human-Robot Cooperation for Tasks With General Goal

Arxiv

0+阅读 · 2022年7月2日

Ontology-Based Anomaly Detection for Air Traffic Control Systems

Arxiv

0+阅读 · 2022年7月1日

DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware

Arxiv

0+阅读 · 2022年6月30日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

相关VIP内容

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

史上最全！358篇机器学习&自然语言处理综述论文！都这儿了

专知会员服务

129+阅读 · 2020年7月18日

零样本文本分类，Zero-Shot Learning for Text Classification

零样本文本分类，Zero-Shot Learning for Text Classification

专知会员服务

97+阅读 · 2020年5月31日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【深度学习表格检测、信息提取和结构化】《Table Detection, Information Extraction and Structuring using Deep Learning》by Vihar Kurama

专知会员服务

38+阅读 · 2020年1月23日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

ACM TOMM Call for Papers

ACM TOMM Call for Papers

CCF多媒体专委会

2+阅读 · 2022年3月23日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

【跟踪Tracking】15篇论文+代码 | 中秋快乐~

专知

18+阅读 · 2018年9月24日

相关论文

AI-enhanced iterative solvers for accelerating the solution of large scale parametrized linear systems of equations

Arxiv

0+阅读 · 2022年7月6日

Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling

Arxiv

0+阅读 · 2022年7月5日

The Neural-Prediction based Acceleration Algorithm of Column Generation for Graph-Based Set Covering Problems

The Neural-Prediction based Acceleration Algorithm of Column Generation for Graph-Based Set Covering Problems

Arxiv

0+阅读 · 2022年7月4日

Protea: Client Profiling within Federated Systems using Flower

Arxiv

0+阅读 · 2022年7月3日

Forming Real-World Human-Robot Cooperation for Tasks With General Goal

Arxiv

0+阅读 · 2022年7月2日

Ontology-Based Anomaly Detection for Air Traffic Control Systems

Arxiv

0+阅读 · 2022年7月1日

DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware

Arxiv

0+阅读 · 2022年6月30日

Smart Application for Fall Detection Using Wearable ECG & Accelerometer Sensors

Arxiv

0+阅读 · 2022年6月28日

A Survey of Model Compression and Acceleration for Deep Neural Networks

Arxiv

66+阅读 · 2019年9月8日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

相关基金

两类带导数的非线性Schrodinger方程拟周期解的存在性

国家自然科学基金

0+阅读 · 2015年12月31日

Stokes/Darcy 耦合问题的数值方法及预处理技术研究

国家自然科学基金

0+阅读 · 2015年12月31日

几类高阶非线性行波方程的精确解,分支和复杂动力学研究

国家自然科学基金

0+阅读 · 2014年12月31日

压缩感知与稀疏信号恢复

国家自然科学基金

2+阅读 · 2014年12月31日

结构化多项式系统的三角化求解方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

电磁场涡流问题中结构化线性方程组的预处理方法

国家自然科学基金

0+阅读 · 2013年12月31日

不可压缩流边界控制问题的可扩展并行算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于Decorin基因甲基化调控的非小细胞肺癌转移的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

SIAH2抑制PEG10泛素化降解并影响肝癌生长和侵袭转移的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

基于两重网格的Navier-Stokes方程并行自适应后处理及变分多尺度算法研究

国家自然科学基金

0+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员