使用64比特、32比特和16比特新数字格式的拉蒂斯·博尔茨曼方法的准确性和性能 (On the accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit and novel 16-bit number formats) - 专知论文

会员服务 ·

0

Performer · 查准率/准确率 · 模型评估 · CASES · 值域 ·

2022 年 1 月 31 日

On the accuracy and performance of the lattice Boltzmann method with 64-bit, 32-bit and novel 16-bit number formats

翻译：使用64比特、32比特和16比特新数字格式的拉蒂斯·博尔茨曼方法的准确性和性能

Moritz Lehmann,Mathias J. Krause,Giorgio Amati,Marcello Sega,Jens Harting,Stephan Gekle

from arxiv, 30 pages, 20 figures, 4 tables, 2 code listings

Fluid dynamics simulations with the lattice Boltzmann method (LBM) are very memory-intensive. Alongside reduction in memory footprint, significant performance benefits can be achieved by using FP32 (single) precision compared to FP64 (double) precision, especially on GPUs. Here, we evaluate the possibility to use even FP16 and Posit16 (half) precision for storing fluid populations, while still carrying arithmetic operations in FP32. For this, we first show that the commonly occurring number range in the LBM is a lot smaller than the FP16 number range. Based on this observation, we develop novel 16-bit formats - based on a modified IEEE-754 and on a modified Posit standard - that are specifically tailored to the needs of the LBM. We then carry out an in-depth characterization of LBM accuracy for six different test systems with increasing complexity: Poiseuille flow, Taylor-Green vortices, Karman vortex streets, lid-driven cavity, a microcapsule in shear flow (utilizing the immersed-boundary method) and finally the impact of a raindrop (based on a Volume-of-Fluid approach). We find that the difference in accuracy between FP64 and FP32 is negligible in almost all cases, and that for a large number of cases even 16-bit is sufficient. Finally, we provide a detailed performance analysis of all precision levels on a large number of hardware microarchitectures and show that significant speedup is achieved with mixed FP32/16-bit.

翻译：使用 lattice Boltzmann 方法( LBM ) 的流体动态模拟( 流体动态模拟) 是非常记忆密集的。在减少记忆足迹的同时, 使用 FP32 (单) 精度比 FP64 (双) 精度可以实现显著的绩效效益, 特别是在 GPUs 上。这里, 我们评估了使用甚至 FP16 和 Posit 16 (半) 精度来存储流体人口的可能性, 同时仍然在 FP32 中进行计算操作。在这方面, 我们首先显示 LBM 中常见的数字范围远小于 FP16 数量。根据这项观察, 我们开发了新型的16位格式( 以修改后的 IEEE- 754 和修改后的 Positit 标准为基础) 16 。与 FPBM 的精度具体针对 LBM 需要。然后, 我们对六种不同测试系统的LBM 精度进行了深度描述: Poiseuill 流、 Tay- Gart 、Karman vortance、 libled cal cal dal 和 main 方法最后显示了多少的精度的精度大小。我们发现所有FFFP- 的精度分析。

0

相关内容

Performer

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【课程推荐】深度学习中的几何（Geometry of Deep Learning）

【课程推荐】深度学习中的几何（Geometry of Deep Learning）

专知会员服务

59+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

旋量玻色爱因斯坦凝聚体动力学的解析研究

国家自然科学基金

1+阅读 · 2014年12月31日

微流体中纳米颗粒的非线性电动操控机理和实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于GPU的脉冲星宽带观测的相干消色散研究

国家自然科学基金

0+阅读 · 2013年12月31日

一类Monge-Ampère方程解的边界行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

几何结构形变空间的几何拓扑

国家自然科学基金

0+阅读 · 2012年12月31日

弹性凸点制备的静电喷雾方法研究与应用

国家自然科学基金

0+阅读 · 2011年12月31日

基于Top-hat变换的多尺度多梯度多结构元素图像处理技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微流体中纳米颗粒的电动力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

Demonstration of Superconducting Optoelectronic Single-Photon Synapses

Arxiv

0+阅读 · 2022年4月20日

An Analytic Propositional Proof System on Graphs

Arxiv

0+阅读 · 2022年4月20日

A mixed finite element method with piecewise linear elements for the biharmonic equation on surfaces

Arxiv

0+阅读 · 2022年4月20日

Event-triggered Approximate Byzantine Consensus with Multi-hop Communication

Event-triggered Approximate Byzantine Consensus with Multi-hop Communication

Arxiv

0+阅读 · 2022年4月19日

Extensions of the Deep Galerkin Method

Arxiv

0+阅读 · 2022年4月19日

Deep Equilibrium Optical Flow Estimation

Arxiv

0+阅读 · 2022年4月18日

Communication Bounds for Convolutional Neural Networks

Communication Bounds for Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

Improved Convergence Rate of Stochastic Gradient Langevin Dynamics with Variance Reduction and its Application to Optimization

Arxiv

0+阅读 · 2022年4月16日

Wake Up and Join Me! An Energy-Efficient Algorithm for Maximal Matching in Radio Networks

Arxiv

0+阅读 · 2022年4月16日

Performance and Construction of Polar Codes: The Perspective of Bit Error Probability

Arxiv

0+阅读 · 2022年4月15日

VIP会员

文章信息

相关主题

查准率/准确率

相关VIP内容

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【课程推荐】深度学习中的几何（Geometry of Deep Learning）

【课程推荐】深度学习中的几何（Geometry of Deep Learning）

专知会员服务

59+阅读 · 2019年11月10日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

开源书：PyTorch深度学习起步

开源书：PyTorch深度学习起步

专知会员服务

51+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

最新BERT相关论文清单，BERT-related Papers

最新BERT相关论文清单，BERT-related Papers

专知会员服务

53+阅读 · 2019年9月29日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

IEEE TII Call For Papers

IEEE TII Call For Papers

CCF多媒体专委会

3+阅读 · 2022年3月24日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium6

中国图象图形学学会CSIG

2+阅读 · 2021年11月12日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium5

中国图象图形学学会CSIG

1+阅读 · 2021年11月11日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

RoBERTa中文预训练模型：RoBERTa for Chinese

RoBERTa中文预训练模型：RoBERTa for Chinese

PaperWeekly

57+阅读 · 2019年9月16日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Demonstration of Superconducting Optoelectronic Single-Photon Synapses

Arxiv

0+阅读 · 2022年4月20日

An Analytic Propositional Proof System on Graphs

Arxiv

0+阅读 · 2022年4月20日

A mixed finite element method with piecewise linear elements for the biharmonic equation on surfaces

Arxiv

0+阅读 · 2022年4月20日

Event-triggered Approximate Byzantine Consensus with Multi-hop Communication

Event-triggered Approximate Byzantine Consensus with Multi-hop Communication

Arxiv

0+阅读 · 2022年4月19日

Extensions of the Deep Galerkin Method

Arxiv

0+阅读 · 2022年4月19日

Deep Equilibrium Optical Flow Estimation

Arxiv

0+阅读 · 2022年4月18日

Communication Bounds for Convolutional Neural Networks

Communication Bounds for Convolutional Neural Networks

Arxiv

0+阅读 · 2022年4月18日

Improved Convergence Rate of Stochastic Gradient Langevin Dynamics with Variance Reduction and its Application to Optimization

Arxiv

0+阅读 · 2022年4月16日

Wake Up and Join Me! An Energy-Efficient Algorithm for Maximal Matching in Radio Networks

Arxiv

0+阅读 · 2022年4月16日

Performance and Construction of Polar Codes: The Perspective of Bit Error Probability

Arxiv

0+阅读 · 2022年4月15日

相关基金

旋量玻色爱因斯坦凝聚体动力学的解析研究

国家自然科学基金

1+阅读 · 2014年12月31日

微流体中纳米颗粒的非线性电动操控机理和实验研究

国家自然科学基金

0+阅读 · 2014年12月31日

基于GPU的脉冲星宽带观测的相干消色散研究

国家自然科学基金

0+阅读 · 2013年12月31日

一类Monge-Ampère方程解的边界行为

国家自然科学基金

0+阅读 · 2013年12月31日

Vlasov-Poisson-Boltzmann方程研究

国家自然科学基金

0+阅读 · 2013年12月31日

几何结构形变空间的几何拓扑

国家自然科学基金

0+阅读 · 2012年12月31日

弹性凸点制备的静电喷雾方法研究与应用

国家自然科学基金

0+阅读 · 2011年12月31日

基于Top-hat变换的多尺度多梯度多结构元素图像处理技术研究

国家自然科学基金

0+阅读 · 2009年12月31日

微流体中纳米颗粒的电动力学研究

国家自然科学基金

0+阅读 · 2009年12月31日

超过程及相关SPDE的研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员