Weather and climate simulations produce petabytes of high-resolution data that are later analyzed by researchers in order to understand climate change or severe weather. We propose a new method of compressing this multidimensional weather and climate data: a coordinate-based neural network is trained to overfit the data, and the resulting parameters are taken as a compact representation of the original grid-based data. While compression ratios range from 300x to more than 3,000x, our method outperforms the state-of-the-art compressor SZ3 in terms of weighted RMSE, MAE. It can faithfully preserve important large scale atmosphere structures and does not introduce artifacts. When using the resulting neural network as a 790x compressed dataloader to train the WeatherBench forecasting model, its RMSE increases by less than 2%. The three orders of magnitude compression democratizes access to high-resolution climate data and enables numerous new research directions.
翻译:天气和气候模拟产生的高分辨率数据量达到百万亿,后来被研究人员分析以理解气候变化或严重天气。我们提出了一种新的方法来压缩这种多维天气和气候数据:利用坐标型神经网络来过度拟合数据,并将得到的参数作为原始基于网格的数据的紧凑表示。尽管压缩比率范围从300倍到超过3,000倍不等,但我们的方法在加权RMSE,MAE方面优于现有的SZ3压缩器。它可以忠实地保留重要的大尺度大气结构,并且不会引入人工构件。当将所得神经网络用作790倍压缩的数据加载器来训练WeatherBench预测模型时,其RMSE仅增加不到2%。三个数量级的压缩使高分辨率气候数据民主化,开启了众多新的研究方向。