将C+++模板元模版-元方案化抽象层与基于指令的卸载相匹配的挑战 (Challenges Porting a C++ Template-Metaprogramming Abstraction Layer to Directive-based Offloading) - 专知论文

会员服务 ·

0

层 · 编译器 · MoDELS · API · 情景 ·

2021 年 10 月 16 日

Challenges Porting a C++ Template-Metaprogramming Abstraction Layer to Directive-based Offloading

翻译：将C+++模板元模版-元方案化抽象层与基于指令的卸载相匹配的挑战

Jeffrey Kelling,Sergei Bastrakov,Alexander Debus,Thomas Kluge,Matt Leinhauser,Richard Pausch2,Klaus Steiniger,Jan Stephan,René Widera,Jeff Young,Michael Bussmann,Sunita Chandrasekaran,Guido Juckeland

from arxiv, 20 pages, 1 figure, 3 tables, WACCPD@SC21

HPC systems employ a growing variety of compute accelerators with different architectures and from different vendors. Large scientific applications are required to run efficiently across these systems but need to retain a single code-base in order to not stifle development. Directive-based offloading programming models set out to provide the required portability, but, to existing codes, they themselves represent yet another API to port to. Here, we present our approach of porting the GPU-accelerated particle-in-cell code PIConGPU to OpenACC and OpenMP target by adding two new backends to its existing C++-template metaprogramming-based offloading abstraction layer alpaka and avoiding other modifications to the application code. We introduce our approach in the face of conflicts between requirements and available features in the standards as well as practical hurdles posed by immature compiler support.

翻译：HPC系统使用各种不同建筑和不同销售商的计算加速器。大型科学应用需要在这些系统之间有效运行,但需要保留一个单一的代码库,以避免抑制发展。基于指令的卸载编程模型设定了必要的可移动性,但对现有代码来说,它们本身代表着另一个可以移植的API。在这里,我们介绍了我们将GPU加速的微粒细胞代码 PICONGPU 移植到开放ACC和 OpenMP 目标的方法,在现有的C++-template元程序的基础上卸载抽象藻层,并避免对应用代码的其他修改。我们介绍了我们面对标准中的要求和现有特点之间的冲突以及不成熟的编译者支持造成的实际障碍的方法。

0

相关内容

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

Python数据结构与算法，540页pdf

专知会员服务

113+阅读 · 2021年9月22日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

大规模图神经网络系统综述

专知会员服务

140+阅读 · 2021年3月30日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

Keras作者François Chollet推荐的开源图像搜索引擎项目Sis

Keras作者François Chollet推荐的开源图像搜索引擎项目Sis

专知会员服务

30+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

lightgbm algorithm case of kaggle（上）

lightgbm algorithm case of kaggle（上）

R语言中文社区

8+阅读 · 2018年3月20日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

GIFdroid: Automated Replay of Visual Bug Reports for Android Apps

Arxiv

0+阅读 · 2021年12月15日

Multi-Instance Training for Question Answering Across Table and Linked Text

Arxiv

0+阅读 · 2021年12月14日

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

Arxiv

0+阅读 · 2021年12月14日

FedAdapt: Adaptive Offloading for IoT Devices in Federated Learning

Arxiv

0+阅读 · 2021年12月13日

Opacus: User-Friendly Differential Privacy Library in PyTorch

Arxiv

0+阅读 · 2021年12月10日

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Arxiv

3+阅读 · 2021年5月12日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

18+阅读 · 2020年3月19日

End-to-End Learning for Answering Structured Queries Directly over Text

Arxiv

3+阅读 · 2018年11月16日

Open Information Extraction on Scientific Text: An Evaluation

Arxiv

6+阅读 · 2018年2月15日

Assertion-based QA with Question-Aware Open Information Extraction

Arxiv

3+阅读 · 2018年1月23日

VIP会员

文章信息

相关主题

相关VIP内容

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

【2021新书】并行高性能计算，705页pdf，Parallel and High Performance Computing

专知会员服务

105+阅读 · 2021年10月30日

Python数据结构与算法，540页pdf

专知会员服务

113+阅读 · 2021年9月22日

《行为与认知机器人学》，241页pdf

《行为与认知机器人学》，241页pdf

专知会员服务

54+阅读 · 2021年4月11日

大规模图神经网络系统综述

专知会员服务

140+阅读 · 2021年3月30日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

【SIGIR2020】高效查询自动补全，Efficient and Effective Query Auto-Completion

专知会员服务

10+阅读 · 2020年5月14日

Keras作者François Chollet推荐的开源图像搜索引擎项目Sis

Keras作者François Chollet推荐的开源图像搜索引擎项目Sis

专知会员服务

30+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

【电子书推荐】Data Science with Python and Dask

【电子书推荐】Data Science with Python and Dask

专知会员服务

44+阅读 · 2019年6月1日

热门VIP内容

开通专知VIP会员享更多权益服务

【新书】面向企业的图学习扩展：生产级图学习与推理，485页pdf

AI智能体编程：技术、挑战与机遇综述

【国家标准】数据安全技术数据安全风险评估方法

【CMU博士论文】交互式学习的进展：替代性反馈机制与自适应因果推理

相关资讯

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

无人机视觉挑战赛 | ICCV 2019 Workshop—VisDrone2019

PaperWeekly

7+阅读 · 2019年5月5日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

人工智能 | SCI期刊专刊信息3条

人工智能 | SCI期刊专刊信息3条

Call4Papers

5+阅读 · 2019年1月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

计算机类 | ISCC 2019等国际会议信息9条

计算机类 | ISCC 2019等国际会议信息9条

Call4Papers

5+阅读 · 2018年12月25日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

lightgbm algorithm case of kaggle（上）

lightgbm algorithm case of kaggle（上）

R语言中文社区

8+阅读 · 2018年3月20日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

相关论文

GIFdroid: Automated Replay of Visual Bug Reports for Android Apps

Arxiv

0+阅读 · 2021年12月15日

Multi-Instance Training for Question Answering Across Table and Linked Text

Arxiv

0+阅读 · 2021年12月14日

Matrix-free approaches for GPU acceleration of a high-order finite element hydrodynamics application using MFEM, Umpire, and RAJA

Arxiv

0+阅读 · 2021年12月14日

FedAdapt: Adaptive Offloading for IoT Devices in Federated Learning

Arxiv

0+阅读 · 2021年12月13日

Opacus: User-Friendly Differential Privacy Library in PyTorch

Arxiv

0+阅读 · 2021年12月10日

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Kleister: Key Information Extraction Datasets Involving Long Documents with Complex Layouts

Arxiv

3+阅读 · 2021年5月12日

Recent Advances and Challenges in Task-oriented Dialog System

Recent Advances and Challenges in Task-oriented Dialog System

Arxiv

18+阅读 · 2020年3月19日

End-to-End Learning for Answering Structured Queries Directly over Text

Arxiv

3+阅读 · 2018年11月16日

Open Information Extraction on Scientific Text: An Evaluation

Arxiv

6+阅读 · 2018年2月15日

Assertion-based QA with Question-Aware Open Information Extraction

Arxiv

3+阅读 · 2018年1月23日

微信扫码咨询专知VIP会员