Semantic segmentation is a classic computer vision problem dedicated to labeling each pixel with its corresponding category. As a basic task for advanced tasks such as industrial quality inspection, remote sensing information extraction, medical diagnostic aid, and autonomous driving, semantic segmentation has been developed for a long time in combination with deep learning, and a lot of works have been accumulated. However, neither the classic FCN-based works nor the popular Transformer-based works have attained fine-grained localization of pixel labels, which remains the main challenge in this field. Recently, with the popularity of autonomous driving, the segmentation of road scenes has received increasing attention. Based on the cross-task consistency theory, we incorporate edge priors into semantic segmentation tasks to obtain better results. The main contribution is that we provide a model-agnostic method that improves the accuracy of semantic segmentation models with zero extra inference runtime overhead, verified on the datasets of road and non-road scenes. From our experimental results, our method can effectively improve semantic segmentation accuracy.
翻译:语义分解是一个典型的计算机视觉问题,专门用来给每个像素贴上相应的类别。作为工业质量检查、遥感信息提取、医疗诊断协助和自主驾驶等高级任务的基本任务,长期以来,在深层学习的同时,还开发了语义分解,并积累了大量工作。然而,经典的FCN工程和流行的变异器工程都没有达到像素标签的精密本地化,这仍然是该领域的主要挑战。最近,随着自主驾驶的普及,路景的分解受到越来越多的关注。根据跨任务一致性理论,我们把边缘前科纳入语义分解任务,以获得更好的结果。主要贡献是,我们提供了一种模型-通性方法,用零额外引力运行的间接费用提高语义分解模型的准确性,并在道路和非路景的数据集上进行验证。根据我们的实验结果,我们的方法可以有效地提高语义分解的准确性。