生产力、便捷性、性能:数据中心 (Productivity, Portability, Performance: Data-Centric Python) - 专知论文

会员服务 ·

0

Performer · Performance · Python · Extensibility · FPGA ·

2021 年 7 月 1 日

Productivity, Portability, Performance: Data-Centric Python

翻译：生产力、便捷性、性能:数据中心

Alexandros Nikolaos Ziogas,Timo Schneider,Tal Ben-Nun,Alexandru Calotoiu,Tiziano De Matteis,Johannes de Fine Licht,Luca Lavarini,Torsten Hoefler

Python has become the de facto language for scientific computing. Programming in Python is highly productive, mainly due to its rich science-oriented software ecosystem built around the NumPy module. As a result, the demand for Python support in High Performance Computing (HPC) has skyrocketed. However, the Python language itself does not necessarily offer high performance. In this work, we present a workflow that retains Python's high productivity while achieving portable performance across different architectures. The workflow's key features are HPC-oriented language extensions and a set of automatic optimizations powered by a data-centric intermediate representation. We show performance results and scaling across CPU, GPU, FPGA, and the Piz Daint supercomputer (up to 23,328 cores), with 2.47x and 3.75x speedups over previous-best solutions, first-ever Xilinx and Intel FPGA results of annotated Python, and up to 93.16% scaling efficiency on 512 nodes.

翻译：Python语已成为科学计算的实际语言。 Python语的编程效率很高,这主要是因为它围绕NumPy 模块构建了丰富的科学导向软件生态系统。结果,对高性能计算机(HPC)中Python支持的需求急剧上升。然而,Python语本身并不一定能提供高性能。在这项工作中,我们展示了一个保住Python高生产率的工作流程,同时在不同结构中实现便携式性能。工作流程的主要特征是HPC导向语言扩展和一套由以数据为中心的中间代表力驱动的自动优化。我们展示了整个CPU、GPU、FPGA和Piz Daint超级计算机(高达23,328个核心)的性能和规模,其中2.47x和3.75x超前最佳解决方案的加速率,第一个附加注解的 Xilinx 和Intel FPGA结果, 512节点的提升效率达93.16%。

0

相关内容

Performer

【2021新书】《用正确的方式学Python》，456页pdf

【2021新书】《用正确的方式学Python》，456页pdf

专知会员服务

81+阅读 · 2021年6月9日

【2020新书】使用R和Python的高级BI分析，425页pdf

【2020新书】使用R和Python的高级BI分析，425页pdf

专知会员服务

35+阅读 · 2020年10月14日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

44+阅读 · 2020年8月18日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

专知会员服务

103+阅读 · 2020年4月22日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

专知会员服务

35+阅读 · 2020年1月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

Github项目推荐 | pikepdf - Python的PDF读写库

Github项目推荐 | pikepdf - Python的PDF读写库

AI研习社

9+阅读 · 2019年3月29日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

实战 | 用Python做图像处理（三）

实战 | 用Python做图像处理（三）

七月在线实验室

15+阅读 · 2018年5月29日

Python 杠上 Java、C/C++，赢面有几成？

Python 杠上 Java、C/C++，赢面有几成？

CSDN

6+阅读 · 2018年4月12日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

Achieving near native runtime performance and cross-platform performance portability for random number generation through SYCL interoperability

Arxiv

0+阅读 · 2021年9月3日

Establishing Interlingua in Multilingual Language Models

Arxiv

0+阅读 · 2021年9月2日

A New Pathway to Approximate Energy Expenditure and Recovery of an Athlete

Arxiv

0+阅读 · 2021年9月2日

A Data-Centric Framework for Composable NLP Workflows

Arxiv

0+阅读 · 2021年8月31日

Transformer based Grapheme-to-Phoneme Conversion

Arxiv

6+阅读 · 2020年4月14日

Semantics of Data Mining Services in Cloud Computing

Semantics of Data Mining Services in Cloud Computing

Arxiv

4+阅读 · 2018年10月5日

A dataset and architecture for visual reasoning with a working memory

Arxiv

3+阅读 · 2018年3月16日

Parallel Tracking and Verifying

Arxiv

8+阅读 · 2018年1月30日

Predicting Visual Features from Text for Image and Video Caption Retrieval

Arxiv

5+阅读 · 2018年1月29日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

VIP会员

文章信息

相关主题

相关VIP内容

【2021新书】《用正确的方式学Python》，456页pdf

【2021新书】《用正确的方式学Python》，456页pdf

专知会员服务

81+阅读 · 2021年6月9日

【2020新书】使用R和Python的高级BI分析，425页pdf

【2020新书】使用R和Python的高级BI分析，425页pdf

专知会员服务

35+阅读 · 2020年10月14日

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

【2020Manning新书】微型化Python项目，325页pdf，Tiny Python Projects

专知会员服务

44+阅读 · 2020年8月18日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

【实用书】掌握Python数据分析，282页pdf，Mastering Python Data Analysis

专知会员服务

103+阅读 · 2020年4月22日

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

【新书】Python机器学习实战，545页pdf，Practical Machine Learning with Python

专知会员服务

310+阅读 · 2020年2月26日

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

【Python最佳实践、技巧与提示30则】《30 Python Best Practices, Tips, And Tricks》by Erik-Jan van Baaren

专知会员服务

35+阅读 · 2020年1月6日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【ACL2025教程】大语言模型的护栏与安全性：对其应用的安全、可靠与可控引导

《实现协同自主：从人机协作到多智能体系统》最新190页

【ICML2025】SToFM：一种用于空间转录组学的多尺度基础模型

通信网络智能体白皮书V1.0，61页pdf

相关资讯

Github项目推荐 | pikepdf - Python的PDF读写库

Github项目推荐 | pikepdf - Python的PDF读写库

AI研习社

9+阅读 · 2019年3月29日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

实战 | 用Python做图像处理（三）

实战 | 用Python做图像处理（三）

七月在线实验室

15+阅读 · 2018年5月29日

Python 杠上 Java、C/C++，赢面有几成？

Python 杠上 Java、C/C++，赢面有几成？

CSDN

6+阅读 · 2018年4月12日

【推荐】用Python/OpenCV实现增强现实

【推荐】用Python/OpenCV实现增强现实

机器学习研究会

15+阅读 · 2017年11月16日

【推荐】MXNet深度情感分析实战

【推荐】MXNet深度情感分析实战

机器学习研究会

16+阅读 · 2017年10月4日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

【推荐】Python机器学习生态圈(Scikit-Learn相关项目)

机器学习研究会

6+阅读 · 2017年8月23日

相关论文

Achieving near native runtime performance and cross-platform performance portability for random number generation through SYCL interoperability

Arxiv

0+阅读 · 2021年9月3日

Establishing Interlingua in Multilingual Language Models

Arxiv

0+阅读 · 2021年9月2日

A New Pathway to Approximate Energy Expenditure and Recovery of an Athlete

Arxiv

0+阅读 · 2021年9月2日

A Data-Centric Framework for Composable NLP Workflows

Arxiv

0+阅读 · 2021年8月31日

Transformer based Grapheme-to-Phoneme Conversion

Arxiv

6+阅读 · 2020年4月14日

Semantics of Data Mining Services in Cloud Computing

Semantics of Data Mining Services in Cloud Computing

Arxiv

4+阅读 · 2018年10月5日

A dataset and architecture for visual reasoning with a working memory

Arxiv

3+阅读 · 2018年3月16日

Parallel Tracking and Verifying

Arxiv

8+阅读 · 2018年1月30日

Predicting Visual Features from Text for Image and Video Caption Retrieval

Arxiv

5+阅读 · 2018年1月29日

Latent nested nonparametric priors

Arxiv

4+阅读 · 2018年1月15日

微信扫码咨询专知VIP会员