通过对科学数据集进行统计校正,对损失进行探讨 (Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets) - 专知论文

会员服务 ·

0

相关系数 · 统计量 · 可约的 · 分解的 · Performer ·

2021 年 11 月 27 日

Exploring Lossy Compressibility through Statistical Correlations of Scientific Datasets

翻译：通过对科学数据集进行统计校正,对损失进行探讨

David Krasowska,Julie Bessac,Robert Underwood,Jon C. Calhoun,Sheng Di,Franck Cappello

Lossy compression plays a growing role in scientific simulations where the cost of storing their output data can span terabytes. Using error bounded lossy compression reduces the amount of storage for each simulation; however, there is no known bound for the upper limit on lossy compressibility. Correlation structures in the data, choice of compressor and error bound are factors allowing larger compression ratios and improved quality metrics. Analyzing these three factors provides one direction towards quantifying lossy compressibility. As a first step, we explore statistical methods to characterize the correlation structures present in the data and their relationships, through functional models, to compression ratios. We observed a relationship between compression ratios and statistics summarizing correlation structure of the data, which are a first step towards evaluating the theoretical limits of lossy compressibility used to eventually predict compression performance and adapt compressors to correlation structures present in the data.

翻译：在科学模拟中,损失压缩在存储输出数据的成本可以跨越兆字节的科学模拟中发挥着越来越大的作用。使用错误约束损失压缩可以减少每次模拟的存储量; 但是,对于损失压缩的上限没有已知的界限; 数据中的关联结构、压缩机的选择和错误约束是允许较大压缩比率和改进质量指标的因素。分析这三个因素为量化损失压缩提供了一个方向。作为第一步,我们探索统计方法,通过功能模型来描述数据中存在的相关结构及其与压缩比率的关系。我们观察到压缩比率与概述数据相关结构的统计之间的关系,这是评估最终预测压缩性表现和使压缩机适应数据中存在的相关结构所使用的损失压缩性理论限制的第一步。

0

相关内容

相关系数

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

26+阅读 · 2020年5月6日

【IJCAI2020】TransOMCS: 从语言图谱到常识图谱

【IJCAI2020】TransOMCS: 从语言图谱到常识图谱

专知会员服务

35+阅读 · 2020年5月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

已删除

将门创投

5+阅读 · 2020年3月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

Call4Papers

6+阅读 · 2019年4月1日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Consistency of orthology and paralogy constraints in the presence of gene transfers

Arxiv

0+阅读 · 2022年1月31日

SZx: an Ultra-fast Error-bounded Lossy Compressor for Scientific Datasets

Arxiv

0+阅读 · 2022年1月31日

COIN++: Data Agnostic Neural Compression

Arxiv

0+阅读 · 2022年1月30日

Understanding and Compressing Music with Maximal Transformable Patterns

Understanding and Compressing Music with Maximal Transformable Patterns

Arxiv

0+阅读 · 2022年1月28日

Detecting Bias in the Presence of Spatial Autocorrelation

Detecting Bias in the Presence of Spatial Autocorrelation

Arxiv

0+阅读 · 2022年1月28日

A Map of Science in Wikipedia

Arxiv

0+阅读 · 2022年1月28日

Lossy Compression for Lossless Prediction

Arxiv

0+阅读 · 2022年1月28日

Data-Time Tradeoffs for Optimal k-Thresholding Algorithms in Compressed Sensing

Arxiv

0+阅读 · 2022年1月28日

VideoDG: Generalizing Temporal Relations in Videos to Novel Domains

Arxiv

14+阅读 · 2021年9月17日

Dynamic Graph Attention for Referring Expression Comprehension

Dynamic Graph Attention for Referring Expression Comprehension

Arxiv

6+阅读 · 2019年9月18日

VIP会员

文章信息

相关主题

相关VIP内容

计算机科学课程与视频课件合集，Computer Science courses with video lectures

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

【知识图谱@EMNLP2020】Knowledge Graphs in NLP @ EMNLP 2020

专知会员服务

43+阅读 · 2020年11月22日

不可错过！UIUC最新《统计强化学习》课程！

专知会员服务

53+阅读 · 2020年9月7日

数据科学导论，54页ppt，Introduction to Data Science

数据科学导论，54页ppt，Introduction to Data Science

专知会员服务

42+阅读 · 2020年7月27日

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

【IJCAI2020】从语言图谱到常识图谱，TransOMCS: From Linguistic Graphs to Commonsense Knowledge

专知会员服务

26+阅读 · 2020年5月6日

【IJCAI2020】TransOMCS: 从语言图谱到常识图谱

【IJCAI2020】TransOMCS: 从语言图谱到常识图谱

专知会员服务

35+阅读 · 2020年5月4日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书】Python数据科学食谱（Python Data Science Cookbook）

【新书】Python数据科学食谱（Python Data Science Cookbook）

专知会员服务

117+阅读 · 2020年1月1日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

已删除

将门创投

5+阅读 · 2020年3月2日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

人工智能 | ISAIR 2019诚邀稿件（推荐SCI期刊）

Call4Papers

6+阅读 · 2019年4月1日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Consistency of orthology and paralogy constraints in the presence of gene transfers

Arxiv

0+阅读 · 2022年1月31日

SZx: an Ultra-fast Error-bounded Lossy Compressor for Scientific Datasets

Arxiv

0+阅读 · 2022年1月31日

COIN++: Data Agnostic Neural Compression

Arxiv

0+阅读 · 2022年1月30日

Understanding and Compressing Music with Maximal Transformable Patterns

Understanding and Compressing Music with Maximal Transformable Patterns

Arxiv

0+阅读 · 2022年1月28日

Detecting Bias in the Presence of Spatial Autocorrelation

Detecting Bias in the Presence of Spatial Autocorrelation

Arxiv

0+阅读 · 2022年1月28日

A Map of Science in Wikipedia

Arxiv

0+阅读 · 2022年1月28日

Lossy Compression for Lossless Prediction

Arxiv

0+阅读 · 2022年1月28日

Data-Time Tradeoffs for Optimal k-Thresholding Algorithms in Compressed Sensing

Arxiv

0+阅读 · 2022年1月28日

VideoDG: Generalizing Temporal Relations in Videos to Novel Domains

Arxiv

14+阅读 · 2021年9月17日

Dynamic Graph Attention for Referring Expression Comprehension

Dynamic Graph Attention for Referring Expression Comprehension

Arxiv

6+阅读 · 2019年9月18日

微信扫码咨询专知VIP会员