In this paper, we study the task of synthetic-to-real domain generalized semantic segmentation, which aims to learn a model that is robust to unseen real-world scenes using only synthetic data. The large domain shift between synthetic and real-world data, including the limited source environmental variations and the large distribution gap between synthetic and real-world data, significantly hinders the model performance on unseen real-world scenes. In this work, we propose the Style-HAllucinated Dual consistEncy learning (SHADE) framework to handle such domain shift. Specifically, SHADE is constructed based on two consistency constraints, Style Consistency (SC) and Retrospection Consistency (RC). SC enriches the source situations and encourages the model to learn consistent representation across style-diversified samples. RC leverages real-world knowledge to prevent the model from overfitting to synthetic data and thus largely keeps the representation consistent between the synthetic and real-world models. Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning. SHM selects basis styles from the source distribution, enabling the model to dynamically generate diverse and realistic samples during training. Experiments show that our SHADE yields significant improvement and outperforms state-of-the-art methods by 5.05% and 8.35% on the average mIoU of three real-world datasets on single- and multi-source settings, respectively.
翻译:在本文中,我们研究了合成到现实领域通用语义分解的任务,其目的是学习一种对仅使用合成数据的无形真实世界场景十分可靠的模型。合成数据与真实世界数据之间的巨大领域变化,包括来源环境变化有限,合成数据与现实世界数据之间的分布差距很大,这严重阻碍了在隐蔽现实世界场景上的模型性能。在这项工作中,我们提出了处理这种域值变化的SHADE(SHADE)框架。具体地说,SHADE(SHADE)是建立在两个一致性限制基础上的,即“样式一致性(SC)”和“重新回溯镜一致性(RC)”的基础上。SSC丰富了源情况和真实世界数据之间的巨大变化,鼓励了该模型在样式多样化的样本中学习一致性代表一致性。RC利用真实世界知识防止模型过度适应合成数据,从而在很大程度上保持合成和真实世界模式之间的代表性。此外,我们提出了一个新的风格幻觉模块(SHM),以生成对一致性学习至关重要的风格多样化样本。SHM-SHSHS-S-35分别从实际的样式和实验性模型中选择基础的样式,在模型中,使实际源值平均的样本中生成的模型能够产生动态的模型,使动态的模型产生动态的模型和动态模型产生动态的模型。