以 Entropic Aversariarial 数据增强的展望比亚斯以外 (Look Beyond Bias with Entropic Adversarial Data Augmentation)

Deep neural networks do not discriminate between spurious and causal patterns, and will only learn the most predictive ones while ignoring the others. This shortcut learning behaviour is detrimental to a network's ability to generalize to an unknown test-time distribution in which the spurious correlations do not hold anymore. Debiasing methods were developed to make networks robust to such spurious biases but require to know in advance if a dataset is biased and make heavy use of minority counterexamples that do not display the majority bias of their class. In this paper, we argue that such samples should not be necessarily needed because the ''hidden'' causal information is often also contained in biased images. To study this idea, we propose 3 publicly released synthetic classification benchmarks, exhibiting predictive classification shortcuts, each of a different and challenging nature, without any minority samples acting as counterexamples. First, we investigate the effectiveness of several state-of-the-art strategies on our benchmarks and show that they do not yield satisfying results on them. Then, we propose an architecture able to succeed on our benchmarks, despite their unusual properties, using an entropic adversarial data augmentation training scheme. An encoder-decoder architecture is tasked to produce images that are not recognized by a classifier, by maximizing the conditional entropy of its outputs, and keep as much as possible of the initial content. A precise control of the information destroyed, via a disentangling process, enables us to remove the shortcut and leave everything else intact. Furthermore, results competitive with the state-of-the-art on the BAR dataset ensure the applicability of our method in real-life situations.

翻译：深心神经网络并不区分虚假和因果关系模式, 并且只会在忽略其他模式的同时学习最有预测性的模型。这种捷径学习行为会损害一个网络推广未知的测试时间分布的能力, 其间虚假的相关性不再存在。开发了贬低性方法, 使网络变得强大, 以这种虚假的偏差, 但需要事先知道数据集是否有偏差, 并大量使用不显示本类多数偏差的少数反抽样。在本文中, 我们争辩说, 这些样本不一定需要, 因为“ 隐藏” 的因果关系信息通常也包含在偏差图像中。为了研究这个想法, 我们建议公开发布3个合成分类基准, 展示预测性分类快捷点, 每一个性质不同而没有少数的样本作为反证。首先, 我们调查了我们基准上几个最先进的战略的有效性, 并表明它们并没有产生令人满意的结果。然后, 我们提出一个能够实现我们基准基准的架构, 尽管这些“ 隐藏的” 的“ 错觉”, 也使得我们的所有错误的错误信息都包含有偏差的结果。我们建议3个综合的分类基准,, 使用一个通过一个最精确的模型, 通过一个被确认的模型, 通过一个被破坏的模型, 的方法, 一个固定化的方法, 一个固定的系统, 一个固定化的系统, 将它作为固定化的系统, 一个固定的系统, 一个固定的系统, 一个固定的系统, 一个固定的系统, 一个固定的系统, 一个固定的系统, 一个固定的系统, 。