Today's scientific simulations require a significant reduction of data volume because of extremely large amounts of data they produce and the limited I/O bandwidth and storage space. Error-bounded lossy compression has been considered one of the most effective solutions to the above problem. However, little work has been done to improve error-bounded lossy compression for Adaptive Mesh Refinement (AMR) simulation data. Unlike the previous work that only leverages 1D compression, in this work, we propose an approach (TAC) to leverage high-dimensional SZ compression for each refinement level of AMR data. To remove the data redundancy across different levels, we propose several pre-process strategies and adaptively use them based on the data characteristics. We further optimize TAC to TAC+ by improving the lossless encoding stage of SZ compression to efficiently handle many small AMR data blocks after the pre-processing. Experiments on 8 AMR datasets from a real-world large-scale AMR simulation demonstrate that TAC+ can improve the compression ratio by up to 4.9X under the same data distortion, compared to the state-of-the-art method. In addition, we leverage the flexibility of our approach to tune the error bound for each level, which achieves much lower data distortion on two application-specific metrics.
翻译:今天的科学模拟需要大量减少数据量,因为其产生的数据数量巨大,而且I/O带宽和储存空间有限。错误造成的损失压缩被认为是解决上述问题的最有效办法之一。然而,在改进适应性网状精炼(AMR)模拟数据方面,没有做多少改进错误引起的损失压缩工作。与以前只利用1D压缩的工作不同,在这项工作中,我们提议采用一种方法(TAC)来利用高维SZ压缩来填补AMR数据的每个精细度。为了消除不同层次的数据冗余,我们提出若干预处理战略,并根据数据特点适应性地使用这些战略。我们进一步优化了SZ压缩到TAC+的无损编码阶段,以便在预处理后有效地处理许多小型AMR数据块。在实际大规模AMR模拟中,对8个AMR数据集的实验表明,TAC+可以在同一数据扭曲下将压缩率提高到4.9X,与州一级数据扭曲法相比,我们提出了多项预处理方法的适应性使用。此外,我们利用了两种调整式的标准方法的弹性。