The Mixup scheme suggests mixing a pair of samples to create an augmented training sample and has gained considerable attention recently for improving the generalizability of neural networks. A straightforward and widely used extension of Mixup is to combine with regional dropout-like methods: removing random patches from a sample and replacing it with the features from another sample. Albeit their simplicity and effectiveness, these methods are prone to create harmful samples due to their randomness. To address this issue, 'maximum saliency' strategies were recently proposed: they select only the most informative features to prevent such a phenomenon. However, they now suffer from lack of sample diversification as they always deterministically select regions with maximum saliency, injecting bias into the augmented data. In this paper, we present, a novel, yet simple Mixup-variant that captures the best of both worlds. Our idea is two-fold. By stochastically sampling the features and 'grafting' them onto another sample, our method effectively generates diverse yet meaningful samples. Its second ingredient is to produce the label of the grafted sample by mixing the labels in a saliency-calibrated fashion, which rectifies supervision misguidance introduced by the random sampling procedure. Our experiments under CIFAR, Tiny-ImageNet, and ImageNet datasets show that our scheme outperforms the current state-of-the-art augmentation strategies not only in terms of classification accuracy, but is also superior in coping under stress conditions such as data corruption and object occlusion.
翻译:混合法建议混合一对样本,以扩大培训样本,最近人们相当关注改善神经网络的通用性。混合法的一个简单而广泛使用的延伸是结合区域类似辍学的方法:从样本中去除随机的补丁,用另一个样本的特征取而代之。尽管这些方法简单而有效,但由于随机性,很容易产生有害的样本。为了解决这个问题,最近提出了“最突出”的战略:它们只选择最丰富的特征来防止这种现象。然而,它们现在缺乏样本多样化,因为它们总是以最明显的方式选择具有最高清晰度的区域,将偏向性注入扩大的数据。在本文件中,我们展示了一种新颖的、但简单的混合法变异法,捕捉到两个世界的最佳特征。我们的想法是双重的。通过对特征进行随机抽样和“把它们移植到另一个样本”,我们的方法有效地生成了多样化但有意义的样本。它的第二个成分是产生粘固的样本标签,通过在精确性精准的精确度中混合标签,将精准性区域,将偏向偏向数据输入。在随机性模型中,通过我们的图像模型化的模型化方法,通过我们的精确性测试,通过我们的随机性模型将数据显示我们的精确性模型的模型的模型的模型,将数据显示显示我们的数据显示。