Data augmentation is an indispensable technique to improve generalization and also to deal with imbalanced datasets. Recently, AutoAugment has been proposed to automatically search augmentation policies from a dataset and has significantly improved performances on many image recognition tasks. However, its search method requires thousands of GPU hours to train even in a reduced setting. In this paper, we propose Fast AutoAugment algorithm that learns augmentation policies using a more efficient search strategy based on density matching. In comparison to AutoAugment, the proposed algorithm speeds up the search time by orders of magnitude while maintaining the comparable performances on the image recognition tasks with various models and datasets including CIFAR-10, CIFAR-100, and ImageNet.
翻译:数据增强是改进一般化和处理不平衡数据集的一个不可或缺的技术。最近,提议自动增强从数据集自动搜索增强政策,并大大改进了许多图像识别任务的性能。然而,其搜索方法要求数千个GPU小时即使在缩小的设置下也要培训。在本文件中,我们提议快速自动增强算法,利用基于密度匹配的更有效的搜索战略来学习增强政策。与自动增强相比,拟议的算法加快了搜索时间,按数量顺序排列,同时保持图像识别任务的类似性能,使用各种模型和数据集,包括CIFAR-10、CIFAR-100和图像网络。