By inferring latent groups in the training data, recent works introduce invariant learning to the case where environment annotations are unavailable. Typically, learning group invariance under a majority/minority split is empirically shown to be effective in improving out-of-distribution generalization on many datasets. However, theoretical guarantee for these methods on learning invariant mechanisms is lacking. In this paper, we reveal the insufficiency of existing group invariant learning methods in preventing classifiers from depending on spurious correlations in the training set. Specifically, we propose two criteria on judging such sufficiency. Theoretically and empirically, we show that existing methods can violate both criteria and thus fail in generalizing to spurious correlation shifts. Motivated by this, we design a new group invariant learning method, which constructs groups with statistical independence tests, and reweights samples by group label proportion to meet the criteria. Experiments on both synthetic and real data demonstrate that the new method significantly outperforms existing group invariant learning methods in generalizing to spurious correlation shifts.
翻译:通过在培训数据中推断潜在群体,最近的工作在环境说明无法提供的情况下引入了不变化的学习。通常,多数/少数群体差异下的学习群体差异在经验上表明,在改进许多数据集的分布外概括方面是有效的。然而,在学习差异机制方面缺乏对这些方法的理论保障。在本文件中,我们揭示了现有群体差异学习方法不足以防止分类者依赖成套培训中的虚假相关性。具体地说,我们提出了判断这种充分性的两个标准。从理论上和从经验上看,我们表明现有方法可能违反两个标准,从而无法普遍采用虚假的相关性变化。我们为此设计了新的差异学习方法,通过统计独立测试和按群体标签比例对样本进行再加权,以达到标准。关于合成和真实数据的实验表明,新的方法大大优于现有群体差异学习方法,从而普遍地形成了虚假的相关性变化。