多层事件相机降噪基准数据集E-MLB (E-MLB: Multilevel Benchmark for Event-Based Camera Denoising)

Event cameras, such as dynamic vision sensors (DVS), are biologically inspired vision sensors that have advanced over conventional cameras in high dynamic range, low latency and low power consumption, showing great application potential in many fields. Event cameras are more sensitive to junction leakage current and photocurrent as they output differential signals, losing the smoothing function of the integral imaging process in the RGB camera. The logarithmic conversion further amplifies noise, especially in low-contrast conditions. Recently, researchers proposed a series of datasets and evaluation metrics but limitations remain: 1) the existing datasets are small in scale and insufficient in noise diversity, which cannot reflect the authentic working environments of event cameras; and 2) the existing denoising evaluation metrics are mostly referenced evaluation metrics, relying on APS information or manual annotation. To address the above issues, we construct a large-scale event denoising dataset (multilevel benchmark for event denoising, E-MLB) for the first time, which consists of 100 scenes, each with four noise levels, that is 12 times larger than the largest existing denoising dataset. We also propose the first nonreference event denoising metric, the event structural ratio (ESR), which measures the structural intensity of given events. ESR is inspired by the contrast metric, but is independent of the number of events and projection direction. Based on the proposed benchmark and ESR, we evaluate the most representative denoising algorithms, including classic and SOTA, and provide denoising baselines under various scenes and noise levels. The corresponding results and codes are available at https://github.com/KugaMaxx/cuke-emlb.

翻译：事件相机(如动态视觉传感器DVS)是生物学启发的视觉传感器，具有高动态范围、低延迟和低功耗等优点，在许多领域有着巨大的应用潜力。事件相机的输出信号是差分信号，失去了RGB相机中积分成像过程的平滑功能，因此更加敏感于结合泄漏电流和光电流的影响。对数转换进一步放大了噪声，尤其是在低对比度条件下。最近，研究人员提出了一系列数据集和评估指标，但仍存在一些限制：1）现有的数据集规模较小，在噪声多样性方面不足，不能反映事件相机的实际工作环境；2）现有的降噪评估指标大多是参考评估指标，依赖于APS信息或人工标注。为了解决上述问题，我们首次构建了一个大规模的事件降噪数据集(E-MLB) ，其中包含100个场景，每个场景有四个噪声级别，比现有的最大的降噪数据集大12倍。我们还提出了第一个非参考事件降噪指标事件结构比(ESR)，它测量给定事件的结构强度。ESR受对比度度量启发，但与事件数量和投影方向无关。基于提出的基准和ESR，我们评估了最具代表性的降噪算法，包括经典算法和SOTA算法，并在不同的场景和噪声级别下提供降噪基线。相应的结果和代码可在https://github.com/KugaMaxx/cuke-emlb上找到。