Domain shift degrades the performance of object detection models in practical applications. To alleviate the influence of domain shift, plenty of previous work try to decouple and learn the domain-invariant (common) features from source domains via domain adversarial learning (DAL). However, inspired by causal mechanisms, we find that previous methods ignore the implicit insignificant non-causal factors hidden in the common features. This is mainly due to the single-view nature of DAL. In this work, we present an idea to remove non-causal factors from common features by multi-view adversarial training on source domains, because we observe that such insignificant non-causal factors may still be significant in other latent spaces (views) due to the multi-mode structure of data. To summarize, we propose a Multi-view Adversarial Discriminator (MAD) based domain generalization model, consisting of a Spurious Correlations Generator (SCG) that increases the diversity of source domain by random augmentation and a Multi-View Domain Classifier (MVDC) that maps features to multiple latent spaces, such that the non-causal factors are removed and the domain-invariant features are purified. Extensive experiments on six benchmarks show our MAD obtains state-of-the-art performance.
翻译:领域转移会降低实际应用中目标检测模型的性能。为减轻领域转移的影响,先前的大量工作通过领域对抗学习(DAL)从源领域分离学习领域不变(共同)特征。然而,受因果机制的启发,我们发现以前的方法忽略了隐藏在共同特征中的非因果因素的隐式的微不足道的因素。这主要是由于DAL的单视图特性。在此工作中,我们提出了一种通过源领域的多视角对抗训练消除共同特征中的非因果因素的方法。因为我们观察到由于数据的多模结构,这样的微不足道的非因果因素仍然可能在其他潜在空间(视图)中显著。总之,我们提出了一种基于多视角对抗鉴别器(MAD)的领域泛化模型,包括通过随机增强增加源领域多样性的噪声相关性生成器(SCG)和将特征映射到多个潜在空间的多视角域分类器(MVDC),以此去除非因果因素并净化领域不变特征。对六个基准的广泛实验表明,我们的MAD获得了最先进的性能。