Domain generalization (DG) enables generalizing a learning machine from multiple seen source domains to an unseen target one. The general objective of DG methods is to learn semantic representations that are independent of domain labels, which is theoretically sound but empirically challenged due to the complex mixture of common and domain-specific factors. Although disentangling the representations into two disjoint parts has been gaining momentum in DG, the strong presumption over the data limits its efficacy in many real-world scenarios. In this paper, we propose Mix and Reason (\mire), a new DG framework that learns semantic representations via enforcing the structural invariance of semantic topology. \mire\ consists of two key components, namely, Category-aware Data Mixing (CDM) and Adaptive Semantic Topology Refinement (ASTR). CDM mixes two images from different domains in virtue of activation maps generated by two complementary classification losses, making the classifier focus on the representations of semantic objects. ASTR introduces relation graphs to represent semantic topology, which is progressively refined via the interactions between local feature aggregation and global cross-domain relational reasoning. Experiments on multiple DG benchmarks validate the effectiveness and robustness of the proposed \mire.
翻译:DG 方法的总目标是学习独立于域名标签结构的语义表达方式,这种表达方式在理论上是健全的,但由于共同因素和特定领域因素的复杂结合而从经验上受到挑战。虽然将表达方式分解成两个互不相干部分的势头在DG中日益增强,但数据在许多现实世界情景中的效力却因数据被强烈假定而受到限制。在本文中,我们提议了一个新的DG框架,即Mix and reason(\mire),通过执行语义表层结构变异来学习语义表达方式。\mire\ 由两个关键组成部分组成,即:Cel-aware数据混合(CDM)和适应性语义表型再精细化(ASTr),清洁发展机制将两个不同领域的图像混合在一起,这是两个互补分类损失产生的激活地图的结果,使分解器的焦点集中在语义物体的表达方式。ASTRA入代表语义表层学的图表,通过地方地貌整合与全球交叉推理的多重对比关系,正在通过地方地貌整合和拟议DG对比基准之间的交互推理逐步完善。