The task of instance segmentation in remote sensing images, aiming at performing per-pixel labeling of objects at instance level, is of great importance for various civil applications. Despite previous successes, most existing instance segmentation methods designed for natural images encounter sharp performance degradations when directly applied to top-view remote sensing images. Through careful analysis, we observe that the challenges mainly come from lack of discriminative object features due to severe scale variations, low contrasts, and clustered distributions. In order to address these problems, a novel context aggregation network (CATNet) is proposed to improve the feature extraction process. The proposed model exploits three lightweight plug-and-play modules, namely dense feature pyramid network (DenseFPN), spatial context pyramid (SCP), and hierarchical region of interest extractor (HRoIE), to aggregate global visual context at feature, spatial, and instance domains, respectively. DenseFPN is a multi-scale feature propagation module that establishes more flexible information flows by adopting inter-level residual connections, cross-level dense connections, and feature re-weighting strategy. Leveraging the attention mechanism, SCP further augments the features by aggregating global spatial context into local regions. For each instance, HRoIE adaptively generates RoI features for different downstream tasks. We carry out extensive evaluation of the proposed scheme on the challenging iSAID, DIOR, NWPU VHR-10, and HRSID datasets. The evaluation results demonstrate that the proposed approach outperforms state-of-the-arts with similar computational costs. Code is available at https://github.com/yeliudev/CATNet.
翻译:尽管以往取得了成功,但为自然图像设计的多数现有实例分解方法在直接应用到高视遥感图像时会遇到急剧的性能退化。通过仔细分析,我们观察到,挑战主要来自由于规模差异大、对比低和集群分布而缺乏歧视性物体特征。为了解决这些问题,提议建立一个新的背景汇总网络(CATNet)来改进特征提取过程。提议的模型利用了三种轻量插子模块,即密集的类似计算功能金字塔网络(DESFPN)、空间环境金字塔(SCP)和利益提取器的等级区域(HRIE),分别用于综合地貌、空间和实例区域的全球视觉环境。DenseFPN是一个多尺度的地貌传播模块,通过采用不同级别的残余连接、跨层次的密度连接和特征再加权战略来建立更灵活的信息流动。 利用关注机制,SCP-HRI-ID 进一步增强功能特征,通过将全球空间-S的适应性数据系统生成到地方区域。