学习拆分自动比子检测 (Learning to Split for Automatic Bias Detection)

Classifiers are biased when trained on biased datasets. As a remedy, we propose Learning to Split (ls), an algorithm for automatic bias detection. Given a dataset with input-label pairs, ls learns to split this dataset so that predictors trained on the training split generalize poorly to the testing split. This performance gap provides a proxy for measuring the degree of bias in the learned features and can therefore be used to reduce biases. Identifying non-generalizable splits is challenging as we don't have any explicit annotations about how to split. In this work, we show that the prediction correctness of the testing example can be used as a source of weak supervision: generalization performance will drop if we move examples that are predicted correctly away from the testing split, leaving only those that are mispredicted. We evaluate our approach on Beer Review, Waterbirds, CelebA and MNLI. Empirical results show that ls is able to generate astonishingly challenging splits that correlate with human-identified biases. Moreover, we demonstrate that combining robust learning algorithms (such as group DRO) with splits identified by ls enables automatic de-biasing. Compared with previous state-of-the-arts, we substantially improves the worst-group performance (23.4% on average) when the source of biases is unknown during training and validation.

翻译：在有偏差的数据集上培训时,分类者有偏差。作为一种补救措施,我们建议学习分解(ls),这是自动检测偏差的算法。根据一个带有输入标签配对的数据集,我们学会了分割该数据集,这样在培训中受过培训的预测者就会把一般情况分成差到测试分。这个性能差距为测量学习特性中的偏差程度提供了一种替代物,因此可以用来减少偏差。识别非普遍分别具有挑战性,因为我们没有关于如何分解的明确说明。在这项工作中,我们显示测试示例的预测正确性能可以用作监管不力的一个来源:如果我们将预测正确脱离测试分解的示例移走,则一般化性能会下降,只留下那些被错误分解的。我们评估了我们在啤酒审查、水鸟、CelebA和MNLI的方法, 从而可以用来减少偏差。爱心结果显示, 能够产生出惊人的具有挑战性的分差, 与人类识别偏差相关的分法。此外,我们证明, 将稳健的学习算法(如DRO组)与由最差分法确定的分法,在最差分化阶段可以自动地分析中进行,在最差分化的分化中进行。