Ultra-high resolution image segmentation has raised increasing interests in recent years due to its realistic applications. In this paper, we innovate the widely used high-resolution image segmentation pipeline, in which an ultra-high resolution image is partitioned into regular patches for local segmentation and then the local results are merged into a high-resolution semantic mask. In particular, we introduce a novel locality-aware context fusion based segmentation model to process local patches, where the relevance between local patch and its various contexts are jointly and complementarily utilized to handle the semantic regions with large variations. Additionally, we present the alternating local enhancement module that restricts the negative impact of redundant information introduced from the contexts, and thus is endowed with the ability of fixing the locality-aware features to produce refined results. Furthermore, in comprehensive experiments, we demonstrate that our model outperforms other state-of-the-art methods in public benchmarks. Our released codes are available at: https://github.com/liqiokkk/FCtL.
翻译:近年来,超高分辨率图像分解由于其现实应用,引起了越来越多的兴趣。在本文中,我们创新了广泛使用的高分辨率图像分解管道,将超高分辨率图像分解成局部分解的常规补丁,然后将当地结果合并成高分辨率语义遮罩。特别是,我们引入了一个基于地貌认知的背景分解新颖模式,用于处理局部分解,在这种模式中,本地补丁及其各种背景的相关性被联合和补充地用于处理语义区域,且差异很大。此外,我们介绍了可互换的本地强化模块,该模块限制从环境中引入的多余信息的负面影响,从而赋予了确定地貌特征以产生精细结果的能力。此外,在全面实验中,我们证明我们的模型超越了公共基准中的其他状态方法。我们发布的代码可以在https://github.com/liqiokk/FCTL上查阅。