CHUSZ(x) : GPUs 科学数据的最佳错误失重压缩 (cuSZ(x): Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs) - 专知论文

会员服务 ·

0

Performer · 优化器 · 可约的 · 可辨认的 · state-of-the-art ·

2021 年 5 月 27 日

cuSZ(x): Optimizing Error-Bounded Lossy Compression for Scientific Data on GPUs

翻译：CHUSZ(x) : GPUs 科学数据的最佳错误失重压缩

Jiannan Tian,Sheng Di,Xiaodong Yu,Cody Rivera,Kai Zhao,Sian Jin,Yunhe Feng,Xin Liang,Dingwen Tao,Franck Cappello

from arxiv, 12 pages, 3 figures, 8 table, submitted to IEEE Cluster'21

Error-bounded lossy compression is a critical technique for significantly reducing scientific data volumes. With ever-emerging heterogeneous HPC architecture, GPU-accelerated error-bounded compressors (such as cuSZ and cuZFP) have been developed. However, they suffer from either low performance or low compression ratios. To this end, we propose cuSZ(x) to target both high compression ratio and throughput. We identify that data sparsity and data smoothness are key factors for high compression throughput. Our key contributions in this work are fourfold: (1) We propose an efficient compression workflow to adaptively perform run-length encoding and/or variable-length encoding. (2) We derive Lorenzo reconstruction in decompression as multidimensional partial-sum computation and propose a fine-grained Lorenzo reconstruction algorithm for GPU architectures. (3) We carefully optimize each of cuSZ's kernels by leveraging state-of-the-art CUDA parallel primitives. (4) We evaluate cuSZ(x) using seven real-world HPC application datasets on V100 and A100 GPUs. Experiments show cuSZ(x) improves the compression performance and ratios by up to 18.4$\times$ and 5.3$\times$, respectively, over cuSZ on the tested datasets.

翻译：与错误相关的损失压缩是大量减少科学数据数量的关键技术。在不断出现不同的高压聚苯乙烯结构中,已经开发出GPU加速错误压缩器(如 cuSZ 和 cuZFP ) 。但是,它们有低性能或低压缩率。为此,我们提议 cuSZ(x) 以高压缩率和吞吐量为目标。我们确认数据宽度和数据光滑度是高压缩通过量的关键因素。我们在这方面的主要贡献有四重:(1) 我们提出高效压缩工作流程,以适应性地运行运行长编码和/或变长编码。 (2) 我们以多维部分和计算的方式将洛伦佐的减压重建推算成多维度部分和低压缩率。但是我们提出微细度的洛伦佐的重建算法,以高压缩比率为目标。 (3) 我们通过利用州级的CUDA平行原始数据来仔细优化每个库。 (4) 我们用七套真实的HPC应用数据集来评估CUSZ 。

0

相关内容

Performer

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AISTATS2020接受论文】时空对齐，过空间和时间的最优transport（Spatio-Temporal Alignments: Optimal transport through space and time）

【AISTATS2020接受论文】时空对齐，过空间和时间的最优transport（Spatio-Temporal Alignments: Optimal transport through space and time）

专知会员服务

30+阅读 · 2020年1月11日

【Google DeepMind & 斯坦福 AAAI2020】Options of Interest Temporal Abstraction with Interest Function

【Google DeepMind & 斯坦福 AAAI2020】Options of Interest Temporal Abstraction with Interest Function

专知会员服务

5+阅读 · 2020年1月5日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

已删除

将门创投

3+阅读 · 2019年4月12日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

计算机 | CCF推荐会议信息10条

计算机 | CCF推荐会议信息10条

Call4Papers

5+阅读 · 2018年10月18日

CCF B类期刊IPM专刊截稿信息1条

CCF B类期刊IPM专刊截稿信息1条

Call4Papers

3+阅读 · 2018年10月11日

Exploring Autoencoder-based Error-bounded Compression for Scientific Data

Exploring Autoencoder-based Error-bounded Compression for Scientific Data

Arxiv

0+阅读 · 2021年7月20日

Encoder blind combinatorial compressed sensing

Arxiv

0+阅读 · 2021年7月19日

ANFIC: Image Compression Using Augmented Normalizing Flows

Arxiv

0+阅读 · 2021年7月18日

Throughput Maximization of UAV Networks

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Reliability and User-Plane Latency Analysis of mmWave Massive MIMO for Grant-Free URLLC Applications

Arxiv

0+阅读 · 2021年7月17日

Coefficient-Robust A Posteriori Error Estimation for H(curl)-elliptic Problems

Arxiv

0+阅读 · 2021年7月16日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

CuLDA_CGS: Solving Large-scale LDA Problems on GPUs

Arxiv

3+阅读 · 2018年3月13日

Pyramidal RoR for Image Classification

Arxiv

3+阅读 · 2017年10月1日

VIP会员

文章信息

相关主题

state-of-the-art

相关VIP内容

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

最新《人脸识别对抗攻击》综述 | Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey

专知会员服务

26+阅读 · 2020年7月24日

Python分布式计算，171页pdf，Distributed Computing with Python

Python分布式计算，171页pdf，Distributed Computing with Python

专知会员服务

108+阅读 · 2020年5月3日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

UC.Berkeley CS189讲义教材:《机器学习全面指南》，185页pdf

专知会员服务

162+阅读 · 2020年1月16日

【AISTATS2020接受论文】时空对齐，过空间和时间的最优transport（Spatio-Temporal Alignments: Optimal transport through space and time）

【AISTATS2020接受论文】时空对齐，过空间和时间的最优transport（Spatio-Temporal Alignments: Optimal transport through space and time）

专知会员服务

30+阅读 · 2020年1月11日

【Google DeepMind & 斯坦福 AAAI2020】Options of Interest Temporal Abstraction with Interest Function

【Google DeepMind & 斯坦福 AAAI2020】Options of Interest Temporal Abstraction with Interest Function

专知会员服务

5+阅读 · 2020年1月5日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《步兵小单元山地严寒作战指南》美军最新条令200页

《联合作战概念的发展》最新报告

俄制无人机弹药

《复杂场景下自主着陆的模型预测控制技术》92页

相关资讯

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

CCF推荐 | 国际会议信息10条

CCF推荐 | 国际会议信息10条

Call4Papers

8+阅读 · 2019年5月27日

CCF A类 | 顶级会议RTSS 2019诚邀稿件

CCF A类 | 顶级会议RTSS 2019诚邀稿件

Call4Papers

10+阅读 · 2019年4月17日

已删除

将门创投

3+阅读 · 2019年4月12日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

CCF C类 | IJCNN 2019 Special Section : 信息论与深度学习

Call4Papers

5+阅读 · 2018年12月7日

计算机 | CCF推荐会议信息10条

计算机 | CCF推荐会议信息10条

Call4Papers

5+阅读 · 2018年10月18日

CCF B类期刊IPM专刊截稿信息1条

CCF B类期刊IPM专刊截稿信息1条

Call4Papers

3+阅读 · 2018年10月11日

相关论文

Exploring Autoencoder-based Error-bounded Compression for Scientific Data

Exploring Autoencoder-based Error-bounded Compression for Scientific Data

Arxiv

0+阅读 · 2021年7月20日

Encoder blind combinatorial compressed sensing

Arxiv

0+阅读 · 2021年7月19日

ANFIC: Image Compression Using Augmented Normalizing Flows

Arxiv

0+阅读 · 2021年7月18日

Throughput Maximization of UAV Networks

Arxiv

0+阅读 · 2021年7月17日

On Efficient Optimal Transport: An Analysis of Greedy and Accelerated Mirror Descent Algorithms

Arxiv

0+阅读 · 2021年7月17日

Reliability and User-Plane Latency Analysis of mmWave Massive MIMO for Grant-Free URLLC Applications

Arxiv

0+阅读 · 2021年7月17日

Coefficient-Robust A Posteriori Error Estimation for H(curl)-elliptic Problems

Arxiv

0+阅读 · 2021年7月16日

AdderSR: Towards Energy Efficient Image Super-Resolution

AdderSR: Towards Energy Efficient Image Super-Resolution

Arxiv

9+阅读 · 2020年9月18日

CuLDA_CGS: Solving Large-scale LDA Problems on GPUs

Arxiv

3+阅读 · 2018年3月13日

Pyramidal RoR for Image Classification

Arxiv

3+阅读 · 2017年10月1日

微信扫码咨询专知VIP会员