Deep Convolutional Neural Networks have made an incredible progress in many Computer Vision tasks. This progress, however, often relies on the availability of large amounts of the training data, required to prevent over-fitting, which in many domains entails significant cost of manual data labeling. An alternative approach is application of data augmentation (DA) techniques that aim at model regularization by creating additional observations from the available ones. This survey focuses on two DA research streams: image mixing and automated selection of augmentation strategies. First, the presented methods are briefly described, and then qualitatively compared with respect to their key characteristics. Various quantitative comparisons are also included based on the results reported in recent DA literature. This review mainly covers the methods published in the materials of top-tier conferences and in leading journals in the years 2017-2021.
翻译:深入进化神经网络在许多计算机愿景任务中取得了令人难以置信的进展,但是,这一进展往往取决于能否获得大量培训数据,而这些数据是防止过度装配所需的,在许多领域需要大量人工数据标签;另一种办法是采用数据增强技术,目的是通过从现有数据中产生更多观察,实现模式正规化;这项调查侧重于两个DA研究流:图像混合和自动选择增强战略;首先,简要描述介绍所提出的方法,然后就其关键特征进行定性比较;根据DA最近文献中报告的结果,还进行了各种定量比较;这一审查主要涉及2017-2021年最高级会议和主要期刊材料中公布的方法。