Semantic segmentation of Very High Resolution (VHR) remote sensing images is a fundamental task for many applications. However, large variations in the scales of objects in those VHR images pose a challenge for performing accurate semantic segmentation. Existing semantic segmentation networks are able to analyse an input image at up to four resizing scales, but this may be insufficient given the diversity of object scales. Therefore, Multi Scale (MS) test-time data augmentation is often used in practice to obtain more accurate segmentation results, which makes equal use of the segmentation results obtained at the different resizing scales. However, it was found in this study that different classes of objects had their preferred resizing scale for more accurate semantic segmentation. Based on this behaviour, a Stacking-Based Semantic Segmentation (SBSS) framework is proposed to improve the segmentation results by learning this behaviour, which contains a learnable Error Correction Module (ECM) for segmentation result fusion and an Error Correction Scheme (ECS) for computational complexity control. Two ECS, i.e., ECS-MS and ECS-SS, are proposed and investigated in this study. The Floating-point operations (Flops) required for ECS-MS and ECS-SS are similar to the commonly used MS test and the Single-Scale (SS) test, respectively. Extensive experiments on four datasets (i.e., Cityscapes, UAVid, LoveDA and Potsdam) show that SBSS is an effective and flexible framework. It achieved higher accuracy than MS when using ECS-MS, and similar accuracy as SS with a quarter of the memory footprint when using ECS-SS.
翻译:甚高分辨率(VHR)遥感图像的语义分解是许多应用的基本任务。然而,甚高分辨率(VHR)图像中物体的大小差异很大,对准确的语义分解构成挑战。现有的语义分解网络能够分析最多四个调整比例的输入图像,但鉴于物体比例的多样性,这一点可能不够充分。因此,多比例(MS)测试时间数据增强在实践中常常用于获取更准确的分解结果,这使得不同调整规模中取得的分解结果得到同等利用。然而,这项研究发现,不同类别的物体在使用更精确的语义分解分解分解分解法时,更倾向于调整其精度比例。基于此行为,提议采用一个基于Stagging(S)的语义分解结构分解图解图,用于计算复杂性控制的多比例(ECS-MS(ECS-MS)和ECS-SS(ECS-S-S-S-IS)的精度缩略度缩略度框架(ECS-S-S-S-Slational-S)的缩略图,用于S-S-S-S-Servial-ex-LUD-S-S-S-S-LUD-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-Serent-S-S-S-S-S-S-S-S-S-S-S-S-S-SLismOL-S-S-S-S-S-S-Servient-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-I-SLismal-SL-S-SL)和S-I-I-I-S-I-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-S-