Deep learning has achieved remarkable results in many computer vision tasks. Deep neural networks typically rely on large amounts of training data to avoid overfitting. However, labeled data for real-world applications may be limited. By improving the quantity and diversity of training data, data augmentation has become an inevitable part of deep learning model training with image data. As an effective way to improve the sufficiency and diversity of training data, data augmentation has become a necessary part of successful application of deep learning models on image data. In this paper, we systematically review different image data augmentation methods. We propose a taxonomy of reviewed methods and present the strengths and limitations of these methods. We also conduct extensive experiments with various data augmentation methods on three typical computer vision tasks, including semantic segmentation, image classification and object detection. Finally, we discuss current challenges faced by data augmentation and future research directions to put forward some useful research guidance.
翻译:深入学习在许多计算机愿景任务中取得了显著成果。深神经网络通常依靠大量培训数据来避免过度匹配。然而,用于真实世界应用的标签数据可能有限。通过提高培训数据的数量和多样性,数据扩增已成为带有图像数据的深层次学习模式培训的一个不可避免的部分。作为提高培训数据充足性和多样性的有效途径,数据扩增已成为成功应用关于图像数据的深层学习模型的必要组成部分。在本文中,我们系统地审查不同的图像数据扩增方法。我们建议对经过审查的方法进行分类,并展示这些方法的长处和局限性。我们还就三种典型的计算机愿景任务,包括语义分解、图像分类和物体探测,对各种数据扩增能力方法进行了广泛的实验。最后,我们讨论了数据扩增和今后研究方向面临的挑战,以提出一些有用的研究指导。