Advanced data augmentation strategies have widely been studied to improve the generalization ability of deep learning models. Regional dropout is one of the popular solutions that guides the model to focus on less discriminative parts by randomly removing image regions, resulting in improved regularization. However, such information removal is undesirable. On the other hand, recent strategies suggest to randomly cut and mix patches and their labels among training images, to enjoy the advantages of regional dropout without having any pointless pixel in the augmented images. We argue that such random selection strategies of the patches may not necessarily represent sufficient information about the corresponding object and thereby mixing the labels according to that uninformative patch enables the model to learn unexpected feature representation. Therefore, we propose SaliencyMix that carefully selects a representative image patch with the help of a saliency map and mixes this indicative patch with the target image, thus leading the model to learn more appropriate feature representation. SaliencyMix achieves the best known top-1 error of 21.26% and 20.09% for ResNet-50 and ResNet-101 architectures on ImageNet classification, respectively, and also improves the model robustness against adversarial perturbations. Furthermore, models that are trained with SaliencyMix help to improve the object detection performance. Source code is available at https://github.com/SaliencyMix/SaliencyMix.
翻译:为了提高深层学习模式的普及能力,广泛研究了高级数据增强战略,以提高深层学习模式的普及能力;地区辍学是引导模式通过随机删除图像区域来关注非歧视性部分的流行解决方案之一,从而改进了正规化;然而,这种信息删除是不可取的;另一方面,最近的战略建议随机剪裁和混合培训图像中的补丁及其标签,享受区域辍学的好处,而不会有任何无意义的像素;我们认为,这种补丁随机选择战略不一定代表对相应对象的充分信息,从而根据非信息化补丁将标签混合起来,使模型能够学习出乎意料的特征代表。因此,我们建议萨利尼特Mix在突出的地图的帮助下仔细选择一个有代表性的图像补丁,并将这一指示性补丁与目标图像混在一起,从而引导模型学习更适当的特征代表。萨利特Mix在ResNet-50和ResNet-101的图像网络分类中分别达到21.26%和20.09 %的已知头一误差,从而使得模型能够学习出出出出意的特征。此外,我们还提议在Saltix/Mix 改进目标探测的模型。