Patch-aware Batch Normalization用于提高跨领域鲁棒性的研究 (Patch-aware Batch Normalization for Improving Cross-domain Robustness)

Despite the significant success of deep learning in computer vision tasks, cross-domain tasks still present a challenge in which the model's performance will degrade when the training set and the test set follow different distributions. Most existing methods employ adversarial learning or instance normalization for achieving data augmentation to solve this task. In contrast, considering that the batch normalization (BN) layer may not be robust for unseen domains and there exist the differences between local patches of an image, we propose a novel method called patch-aware batch normalization (PBN). To be specific, we first split feature maps of a batch into non-overlapping patches along the spatial dimension, and then independently normalize each patch to jointly optimize the shared BN parameter at each iteration. By exploiting the differences between local patches of an image, our proposed PBN can effectively enhance the robustness of the model's parameters. Besides, considering the statistics from each patch may be inaccurate due to their smaller size compared to the global feature maps, we incorporate the globally accumulated statistics with the statistics from each batch to obtain the final statistics for normalizing each patch. Since the proposed PBN can replace the typical BN, it can be integrated into most existing state-of-the-art methods. Extensive experiments and analysis demonstrate the effectiveness of our PBN in multiple computer vision tasks, including classification, object detection, instance retrieval, and semantic segmentation.

翻译：尽管深度学习在计算机视觉任务中取得了巨大成功，但跨领域任务仍然是一个挑战，在这种任务中，当训练集和测试集遵循不同的分布时，模型的性能将会下降。大多数现有方法采用对抗学习或实例归一化来实现数据增强以解决这个任务。相反，考虑到批处理归一化（BN）层可能对未见领域不具有弹性，并且图像中的局部补丁之间存在差异，我们提出了一种称为patch-aware batch normalization（PBN）的新方法。具体而言，我们首先沿着空间维度将批次的特征图分裂为非重叠的补丁，然后独立地对每个补丁进行标准化，以联合优化每次迭代的共享BN参数。通过利用图像局部补丁之间的差异，我们提出的PBN可以有效地增强模型参数的鲁棒性。此外，考虑到由于其与全局特征图相比较小，因此每个补丁的统计数据可能不准确，因此我们将全局累积统计数据与每个批次的统计数据结合起来以获得标准化每个补丁的最终统计数据。由于提出的PBN可以替换典型的BN，因此它可以集成到大多数现有的最先进的方法中。广泛的实验和分析证明了我们的PBN在多个计算机视觉任务中的有效性，包括分类、目标检测、实例检索和语义分割。