Data augmentation has been widely used to improve deep nerual networks performance. Numerous approaches are suggested, for example, dropout, regularization and image augmentation, to avoid over-ftting and enhancing generalization of neural networks. One of the sub-area within data augmentation is image mixing and deleting. This specific type of augmentation either mixes two images or delete image regions to hide or make certain characteristics of images confusing for the network to force it to emphasize on overall structure of object in image. The model trained with this approach has shown to perform and generalize well as compared to one trained without imgage mixing or deleting. Additional benefit achieved with this method of training is robustness against image corruptions. Due to its low compute cost and success in recent past, many techniques of image mixing and deleting are proposed. This paper provides detailed review on these devised approaches, dividing augmentation strategies in three main categories cut and delete, cut and mix and mixup. The second part of paper emprically evaluates these approaches for image classification, finegrained image recognition and object detection where it is shown that this category of data augmentation improves the overall performance for deep neural networks.
翻译:数据增强已被广泛用于改善深层神经网络的性能,例如,提出了许多方法,例如,辍学、正规化和图像增强,以避免过度盗窃和加强神经网络的普遍化,数据增强中的一个子领域是图像混合和删除。这种特定的增强类型混合了两个图像或删除图像区域,使网络隐藏或使图像的某些特征混淆起来,迫使网络强调图像的整体结构。经过培训的模型显示,与受过培训的不混杂或删除的模型相比,能够进行和普及以及比较这些方法。通过这种培训获得的额外好处是稳健地防止图像腐败。由于这种培训方法的成本低,而且最近取得了成功,因此提出了许多图像混合和删除技术。本文详细审查了这些设计的方法,将增强战略分为三大类,切割、删除、切割、混合和混合。文件第二部分对图像分类、微细微图像识别和对象探测方法进行了评价,其中显示,这类数据增强方法改善了深神经网络的总体性能。