Modern deep neural networks suffer from performance degradation when evaluated on testing data under different distributions from training data. Domain generalization aims at tackling this problem by learning transferable knowledge from multiple source domains in order to generalize to unseen target domains. This paper introduces a novel Fourier-based perspective for domain generalization. The main assumption is that the Fourier phase information contains high-level semantics and is not easily affected by domain shifts. To force the model to capture phase information, we develop a novel Fourier-based data augmentation strategy called amplitude mix which linearly interpolates between the amplitude spectrums of two images. A dual-formed consistency loss called co-teacher regularization is further introduced between the predictions induced from original and augmented images. Extensive experiments on three benchmarks have demonstrated that the proposed method is able to achieve state-of-the-arts performance for domain generalization.
翻译:现代深神经网络在根据培训数据不同分布的测试数据进行评估时,会受到性能退化的影响; 广域化的目的是解决这一问题,从多个来源领域学习可转移的知识,以便向看不见的目标领域推广; 本文介绍了一个新的基于Fourier的视角,用于对领域进行概括化; 主要假设是,Fourier阶段的信息包含高层次的语义学,并不容易受到域变换的影响; 为了迫使模型捕捉阶段信息,我们开发了一个新型的基于Fourier的数据增强战略,称为“振幅组合”,在两种图像的振幅频谱之间进行线性间插。 在原始图像和扩展图像的预测之间,进一步引入了被称为共同教师正规化的双重一致性损失。 对三个基准的广泛实验表明,拟议的方法能够实现域化的状态性能。