Image recognition models that can work in challenging environments (e.g., extremely dark, blurry, or high dynamic range conditions) must be useful. However, creating a training dataset for such environments is expensive and hard due to the difficulties of data collection and annotation. It is desirable if we could get a robust model without the need of hard-to-obtain dataset. One simple approach is to apply data augmentation such as color jitter and blur to standard RGB (sRGB) images in simple scenes. Unfortunately, this approach struggles to yield realistic images in terms of pixel intensity and noise distribution due to not considering the non-linearity of Image Signal Processor (ISP) and noise characteristics of an image sensor. Instead, we propose a noise-accounted RAW image augmentation method. In essence, color jitter and blur augmentation are applied to a RAW image before applying non-linear ISP, yielding realistic intensity. Furthermore, we introduce a noise amount alignment method that calibrates the domain gap in noise property caused by the augmentation. We show that our proposed noise-accounted RAW augmentation method doubles the image recognition accuracy in challenging environments only with simple training data.
翻译:在具有挑战性的环境中(例如极端黑暗、模糊或高度动态范围条件下),图像识别模型必须有用。然而,由于数据收集和注释方面的困难,为此类环境创建培训数据集的费用昂贵和难度很大。如果我们可以获得一个强大的模型,而不需要难以获取的数据集,这是可取的。一个简单的办法是在简单场景中应用诸如颜色抖动和模糊到标准 RGB (sRGB) 图像等数据增强功能。不幸的是,由于不考虑图像信号处理器(ISP)的非线性以及图像传感器的噪音特性,这一方法在像素强度和噪音分布方面很难产生现实的图像。相反,我们建议采用一个噪音计成法,在应用非线性 ISP 之前,对RAW 图像应用彩色振动和模糊放大方法,产生现实的强度。此外,我们引入了一种噪音量调整方法,以校正扩增造成的噪音特性的域间差距。我们提议的 RISW 扩增方法使挑战性环境中的图像精确度翻倍。