Despite impressive success in many tasks, deep learning models are shown to rely on spurious features, which will catastrophically fail when generalized to out-of-distribution (OOD) data. Invariant Risk Minimization (IRM) is proposed to alleviate this issue by extracting domain-invariant features for OOD generalization. Nevertheless, recent work shows that IRM is only effective for a certain type of distribution shift (e.g., correlation shift) while it fails for other cases (e.g., diversity shift). Meanwhile, another thread of method, Adversarial Training (AT), has shown better domain transfer performance, suggesting that it has the potential to be an effective candidate for extracting domain-invariant features. This paper investigates this possibility by exploring the similarity between the IRM and AT objectives. Inspired by this connection, we propose Domainwise Adversarial Training (DAT), an AT-inspired method for alleviating distribution shift by domain-specific perturbations. Extensive experiments show that our proposed DAT can effectively remove domain-varying features and improve OOD generalization under both correlation shift and diversity shift.
翻译:尽管在许多任务中取得了令人印象深刻的成功,但深层次的学习模式却被证明依赖于虚假的特征,这些特征如果被概括到分布范围以外的数据,就会灾难性地失败。建议通过为OOOD的概括化提取域差异性特征来缓解这一问题。然而,最近的工作表明,IMR仅仅对某种类型的分布转移(例如相关变化)有效,而对其他情况(例如多样性变化)则不起作用。与此同时,另一串方法,即反向培训(AT),已经表现出更好的域域转移性能,表明它有可能成为提取域差异性特征的有效候选方。本文探讨了这一可能性,探讨了IMM目标与AT目标之间的相似性。受此联系的启发,我们建议DOD培训(DAT)是一种受AT启发的方法,通过特定域的扰动性变化减轻分布转移。广泛的实验表明,我们提议的DAT可在相关变化和多样性变化中有效地去除域对域的特征和改进OODG的概括化。