In the domain generalization literature, a common objective is to learn representations independent of the domain after conditioning on the class label. We show that this objective is not sufficient: there exist counter-examples where a model fails to generalize to unseen domains even after satisfying class-conditional domain invariance. We formalize this observation through a structural causal model and show the importance of modeling within-class variations for generalization. Specifically, classes contain objects that characterize specific causal features, and domains can be interpreted as interventions on these objects that change non-causal features. We highlight an alternative condition: inputs across domains should have the same representation if they are derived from the same object. Based on this objective, we propose matching-based algorithms when base objects are observed (e.g., through data augmentation) and approximate the objective when objects are not observed (MatchDG). Our simple matching-based algorithms are competitive to prior work on out-of-domain accuracy for rotated MNIST, Fashion-MNIST, PACS, and Chest-Xray datasets. Our method MatchDG also recovers ground-truth object matches: on MNIST and Fashion-MNIST, top-10 matches from MatchDG have over 50% overlap with ground-truth matches.
翻译:在一般化领域文献中,一个共同的目标是在对类标签进行调整后,学习独立于域外的表示。我们显示,这个目标并不足够:存在反示例:模型即使满足了等级条件域的偏差,也未能向隐蔽域推广。我们通过结构性因果模型将这一观察正式化,并表明类内差异模型对于概括化的重要性。具体地说,分类中包含特定因果特性特征的物体,而域可以被解释为对改变非因果特性的这些对象的干预。我们强调一个替代条件:如果来自同一对象,则跨域的投入应有相同的表示。基于此目标,我们提议在观测基对象时(例如通过数据增强)以匹配法为依据的算法,并在未观测对象时接近目标(MatchDG)。我们简单的匹配算法与先前关于旋转的MNIST、Fshion-MIT、PACS和Chest-Xray数据集的外部精确性工作相比,我们的方法MCTDG也从地面-ruth天体-MDG-MMAMA 上恢复了顶级的相匹配。