Modern deep neural networks (DNNs) achieve highly accurate results for many recognition tasks on overhead (e.g., satellite) imagery. One challenge however is visual domain shifts (i.e., statistical changes), which can cause the accuracy of DNNs to degrade substantially and unpredictably when tested on new sets of imagery. In this work we model domain shifts caused by variations in imaging hardware, lighting, and other conditions as non-linear pixel-wise transformations; and we show that modern DNNs can become largely invariant to these types of transformations, if provided with appropriate training data augmentation. In general, however, we do not know the transformation between two sets of imagery. To overcome this problem, we propose a simple real-time unsupervised training augmentation technique, termed randomized histogram matching (RHM). We conduct experiments with two large public benchmark datasets for building segmentation and find that RHM consistently yields comparable performance to recent state-of-the-art unsupervised domain adaptation approaches despite being simpler and faster. RHM also offers substantially better performance than other comparably simple approaches that are widely-used in overhead imagery.
翻译:现代深心神经网络(DNNS)在对管理费用(例如卫星)图像的许多识别任务中取得了非常准确的结果。然而,一个挑战是视觉领域变化(即统计变化),这可能导致DNNs在新图像测试时大量和无法预测地降解。在这项工作中,我们用成像硬件、照明和其他条件的变化作为非线性像素转换的模型来模拟由成像硬件、照明和其他条件变化引起的领域变化;我们表明,如果提供适当的培训数据增强,现代DNNs可以在很大程度上不适应这些类型的转换。但是,一般来说,我们不知道两套图像之间的转换。为了克服这一问题,我们提出了一种简单的实时、不受监督的培训增强技术,称为随机化的直方图匹配(RHMM)。我们用两个大型公共基准数据集进行实验,以建立分化,发现RMMM公司尽管比较简单和快捷,但始终能取得与最近的、最先进的域域适应方法相类似的业绩。RHMM还提供比其他在高空图像中广泛采用的其他可比较的简单方法要好得多的业绩。