Sketching is a stochastic dimension reduction method that preserves geometric structures of data and has applications in high-dimensional regression, low rank approximation and graph sparsification. In this work, we show that sketching can be used to compress simulation data and still accurately estimate time autocorrelation and power spectral density. For a given compression ratio, the accuracy is much higher than using previously known methods. In addition to providing theoretical guarantees, we apply sketching to a molecular dynamics simulation of methanol and find that the estimate of spectral density is 90% accurate using only 10% of the data.
翻译:切除是一种保存数据几何结构并应用于高维回归、低级近似和图形放大的切片尺寸减少方法。 在这项工作中,我们显示,草图可以用来压缩模拟数据,并且仍然准确地估计时间的自动关系和光谱密度。对于特定的压缩比率,精确度比使用先前已知方法要高得多。除了提供理论保证外,我们还将草图应用于甲醇的分子动态模拟,发现光谱密度估计值为90%,只使用10%的数据。