OpDiLib 的 OpenMP 事件自动区分 (Event-Based Automatic Differentiation of OpenMP with OpDiLib) - 专知论文

会员服务 ·

0

Atom（文本编辑器） · TOOLS · 代码 · Performer · 操作 ·

2022 年 6 月 30 日

Event-Based Automatic Differentiation of OpenMP with OpDiLib

翻译：OpDiLib 的 OpenMP 事件自动区分

Johannes Blühdorn,Max Sagebaum,Nicolas R. Gauger

from arxiv, 31 pages, 13 figures, 3 tables, 13 listings; new layout, additional references, refocused Section 3 (former Section 4), extended performance tests, overall polishing and shortening

We present the new software OpDiLib, a universal add-on for classical operator overloading AD tools that enables the automatic differentiation (AD) of OpenMP parallelized code. With it, we establish support for OpenMP features in a reverse mode operator overloading AD tool to an extent that was previously only reported on in source transformation tools. We achieve this with an event-based implementation ansatz that is unprecedented in AD. Combined with modern OpenMP features around OMPT, we demonstrate how it can be used to achieve differentiation without any additional modifications of the source code; neither do we impose a priori restrictions on the data access patterns, which makes OpDiLib highly applicable. For further performance optimizations, restrictions like atomic updates on adjoint variables can be lifted in a fine-grained manner. OpDiLib can also be applied in a semi-automatic fashion via a macro interface, which supports compilers that do not implement OMPT. We demonstrate the applicability of OpDiLib for a pure operator overloading approach in a hybrid parallel environment. We quantify the cost of atomic updates on adjoint variables and showcase the speedup and scaling that can be achieved with the different configurations of OpDiLib in both the forward and the reverse pass.

翻译：我们展示了新的软件OpdiLib, 这是一种通用的软件 OpdiLib, 用于经典操作员超载的AD 工具, 使得 OpenMP 平行代码的自动区分( AD) 。有了它, 我们就可以在反向模式操作员超载的 AD 工具中建立对 OpenMP 功能的支持, 其程度以前只在源转换工具中报告过。我们用一个在 AD 上没有先例的基于事件的执行 ansatz 实现这一点。加上在 OMPT 周围的现代 OpenMP 功能, 我们展示了如何在不进一步修改源代码的情况下使用 OpdiLib 来实现差异化; 我们也没有对数据访问模式施加先验限制, 从而使得 OpdiLib 高度适用。对于进一步的性能优化, 可以以细微的精细度的方式取消对自动更新对自动变量的限制。 OpdiLb 还可以通过一个宏观接口, 半自动应用, 支持不执行 OMOPT。我们证明 OpdiL 可用于在混合平行环境中的纯过往配置。我们量化原子更新的同步更新成本成本的成本, 。

0

相关内容

Atom（文本编辑器）

Atom（文本编辑器）

GitHub 发布的文本编辑器。

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

家蚕丝素酶fibroinase的结构解析及其活性周期性变化的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

染色质重塑因子（ARID1A）突变在膀胱移行细胞癌发生中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

食管癌中NTRK3基因重排的鉴定及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限长区域中的空间耦合多元Rateless码研究

国家自然科学基金

0+阅读 · 2012年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

hTERT调控相关miRNA的鉴定及功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

几何动力学在非完整系统几何数值积分中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

Optimal Client Sampling for Federated Learning

Optimal Client Sampling for Federated Learning

Arxiv

0+阅读 · 2022年8月22日

Event-Triggered Model Predictive Control with Deep Reinforcement Learning for Autonomous Driving

Arxiv

0+阅读 · 2022年8月22日

Event-Based Beam Tracking with Dynamic Beamwidth Adaptation in Terahertz (THz) Communications

Arxiv

0+阅读 · 2022年8月21日

A Novel NOMA Solution with RIS Partitioning

Arxiv

0+阅读 · 2022年8月20日

Federated Select: A Primitive for Communication- and Memory-Efficient Federated Learning

Arxiv

0+阅读 · 2022年8月19日

Exploration, Path Planning with Obstacle and Collision Avoidance in a Dynamic Environment

Arxiv

0+阅读 · 2022年8月19日

Physics-Informed Neural Network Method for Parabolic Differential Equations with Sharply Perturbed Initial Conditions

Arxiv

0+阅读 · 2022年8月18日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

VIP会员

文章信息

相关主题

Atom（文本编辑器）

相关VIP内容

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

【干货书】机器学习设计模式，408页pdf，Machine Learning Design Patterns

专知会员服务

138+阅读 · 2022年2月6日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

167+阅读 · 2020年3月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】基础模型训练中网络规模数据的负责任与高效使用

《俄乌战争背景下俄罗斯的战略性海军分析（2022-2025年）》最新100页报告

人工智能时代背景下的未来海战

相关资讯

VCIP 2022 Call for Special Session Proposals

VCIP 2022 Call for Special Session Proposals

CCF多媒体专委会

1+阅读 · 2022年4月1日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium2

中国图象图形学学会CSIG

0+阅读 · 2021年11月8日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

相关论文

Optimal Client Sampling for Federated Learning

Optimal Client Sampling for Federated Learning

Arxiv

0+阅读 · 2022年8月22日

Event-Triggered Model Predictive Control with Deep Reinforcement Learning for Autonomous Driving

Arxiv

0+阅读 · 2022年8月22日

Event-Based Beam Tracking with Dynamic Beamwidth Adaptation in Terahertz (THz) Communications

Arxiv

0+阅读 · 2022年8月21日

A Novel NOMA Solution with RIS Partitioning

Arxiv

0+阅读 · 2022年8月20日

Federated Select: A Primitive for Communication- and Memory-Efficient Federated Learning

Arxiv

0+阅读 · 2022年8月19日

Exploration, Path Planning with Obstacle and Collision Avoidance in a Dynamic Environment

Arxiv

0+阅读 · 2022年8月19日

Physics-Informed Neural Network Method for Parabolic Differential Equations with Sharply Perturbed Initial Conditions

Arxiv

0+阅读 · 2022年8月18日

On Neural Differential Equations

Arxiv

23+阅读 · 2022年2月4日

Multi-Object Tracking with Deep Learning Ensemble for Unmanned Aerial System Applications

Arxiv

26+阅读 · 2021年10月5日

OntoZSL: Ontology-enhanced Zero-shot Learning

Arxiv

17+阅读 · 2021年2月15日

相关基金

家蚕丝素酶fibroinase的结构解析及其活性周期性变化的分子机制

国家自然科学基金

0+阅读 · 2015年12月31日

染色质重塑因子（ARID1A）突变在膀胱移行细胞癌发生中的作用机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

Faecalibacterium prausnitzii协同LFA-1在炎症性肠病发生中调控淋巴细胞分化及功能的作用机制

国家自然科学基金

0+阅读 · 2014年12月31日

食管癌中NTRK3基因重排的鉴定及其作用机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

有限长区域中的空间耦合多元Rateless码研究

国家自然科学基金

0+阅读 · 2012年12月31日

《计算机研究与发展》学术期刊

国家自然科学基金

1+阅读 · 2011年12月31日

hTERT调控相关miRNA的鉴定及功能研究

国家自然科学基金

0+阅读 · 2009年12月31日

可压Navier-Stokes方程及相关流体动力学方程研究

国家自然科学基金

0+阅读 · 2008年12月31日

TR3相互作用新蛋白机理研究

国家自然科学基金

1+阅读 · 2008年12月31日

几何动力学在非完整系统几何数值积分中的应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员