This article presents a weakly supervised machine learning method, which we call DAS-N2N, for suppressing strong random noise in distributed acoustic sensing (DAS) recordings. DAS-N2N requires no manually produced labels (i.e., pre-determined examples of clean event signals or sections of noise) for training and aims to map random noise processes to a chosen summary statistic, such as the distribution mean, median or mode, whilst retaining the true underlying signal. This is achieved by splicing (joining together) two fibres hosted within a single optical cable, recording two noisy copies of the same underlying signal corrupted by different independent realizations of random observational noise. A deep learning model can then be trained using only these two noisy copies of the data to produce a near fully-denoised copy. Once the model is trained, only noisy data from a single fibre is required. Using a dataset from a DAS array deployed on the surface of the Rutford Ice Stream in Antarctica, we demonstrate that DAS-N2N greatly suppresses incoherent noise and enhances the signal-to-noise ratios (SNR) of natural microseismic icequake events. We further show that this approach is inherently more efficient and effective than standard stop/pass band filtering routines and a comparable self-supervised learning method based on masking individual DAS channels. Our preferred model for this task is lightweight, processing 30 seconds of data recorded at a sampling frequency of 1000 Hz over 985 channels (approx. 1 km of fiber) in $<$1 s. Due to the high noise levels in DAS recordings, efficient data-driven denoising methods, such as DAS-N2N, will prove essential to time-critical DAS earthquake detection, particularly in the case of microseismic monitoring.
翻译:本文提出了一种弱监督机器学习方法,我们称之为DAS-N2N,用于抑制分布式声学传感(DAS)记录中的强随机噪声。 DAS-N2N无需手动制作标签(即预先确定的清洁事件信号或噪声部分的示例)进行训练,并旨在将随机噪声过程映射到所选摘要统计量(例如分布平均值,中位数或模式),同时保留真实的基础信号。这通过拼接(组合)单个光缆内托管的两根纤维来实现,记录受不同独立观察噪声实现所污染的相同基础信号的两个噪声副本。然后,可以使用仅这两个噪声数据来训练深度学习模型,以产生近乎完全去噪的副本。一旦训练了模型,只需要来自单个纤维的噪声数据。使用在南极洲路特福德冰川表面部署的DAS阵列的数据集,我们证明DAS-N2N极大地抑制了不相干的噪声,并增强了天然微震冰震事件的信噪比(SNR)。我们进一步显示,此方法固有地比标准的停/通过滤波例程和基于遮蔽单个DAS通道的可比自我监督学习方法更为高效和有效。我们此任务的首选模型是轻量级的,在1 s内处理1000 Hz的采样频率下记录的30秒数据,覆盖985个通道(约1 km的光纤)。由于DAS记录的噪声水平很高,高效的数据驱动去噪方法(例如DAS-N2N)对于时关键的DAS地震检测将证明至关重要,特别是在微震监测的情况下。