Recently, change detection (CD) of remote sensing images have achieved great progress with the advances of deep learning. However, current methods generally deliver incomplete CD regions and irregular CD boundaries due to the limited representation ability of the extracted visual features. To relieve these issues, in this work we propose a novel learning framework named Fully Transformer Network (FTN) for remote sensing image CD, which improves the feature extraction from a global view and combines multi-level visual features in a pyramid manner. More specifically, the proposed framework first utilizes the advantages of Transformers in long-range dependency modeling. It can help to learn more discriminative global-level features and obtain complete CD regions. Then, we introduce a pyramid structure to aggregate multi-level visual features from Transformers for feature enhancement. The pyramid structure grafted with a Progressive Attention Module (PAM) can improve the feature representation ability with additional interdependencies through channel attentions. Finally, to better train the framework, we utilize the deeply-supervised learning with multiple boundaryaware loss functions. Extensive experiments demonstrate that our proposed method achieves a new state-of-the-art performance on four public CD benchmarks. For model reproduction, the source code is released at https://github.com/AI-Zhpp/FTN.
翻译:最近,随着深层学习的进步,遥感图像的变化探测(CD)取得了很大进展,但是,由于提取的视觉特征的代表性能力有限,目前的方法一般提供不完全的CD区域和不规则的CD边界。为了解决这些问题,我们在此工作中提议了一个名为“全变器网络”的新学习框架,用于遥感图像CD,改进从全球视野中提取特征,以金字塔方式结合多级视觉特征。更具体地说,拟议的框架首先利用长距离依赖模型中变异器的优势,有助于学习更具有歧视性的全球级特征,并获得完整的CD区域。然后,我们引入一个金字塔结构,将变异器的多层次视觉特征汇总起来,用于增强特征。金字塔结构与一个进步关注模块(PAM)结合,可以通过频道关注来提高特征的表达能力,通过更多的相互依存关系来提高特征。最后,为了更好地培训框架,我们利用了以多重边界认知损失功能来进行高度超超常的学习。广泛的实验表明,我们拟议的方法在四个公共CD/FP基准上实现了新的状态-艺术表现。