走向深入和高效:在VHR图像中,一个充分高效的革命网络,以探测变化。 (Towards Deep and Efficient: A Deep Siamese Self-Attention Fully Efficient Convolutional Network for Change Detection in VHR Images)

Recently, FCNs have attracted widespread attention in the CD field. In pursuit of better CD performance, it has become a tendency to design deeper and more complicated FCNs, which inevitably brings about huge numbers of parameters and an unbearable computational burden. With the goal of designing a quite deep architecture to obtain more precise CD results while simultaneously decreasing parameter numbers to improve efficiency, in this work, we present a very deep and efficient CD network, entitled EffCDNet. In EffCDNet, to reduce the numerous parameters associated with deep architecture, an efficient convolution consisting of depth-wise convolution and group convolution with a channel shuffle mechanism is introduced to replace standard convolutional layers. In terms of the specific network architecture, EffCDNet does not use mainstream UNet-like architecture, but rather adopts the architecture with a very deep encoder and a lightweight decoder. In the very deep encoder, two very deep siamese streams stacked by efficient convolution first extract two highly representative and informative feature maps from input image-pairs. Subsequently, an efficient ASPP module is designed to capture multi-scale change information. In the lightweight decoder, a recurrent criss-cross self-attention (RCCA) module is applied to efficiently utilize non-local similar feature representations to enhance discriminability for each pixel, thus effectively separating the changed and unchanged regions. Moreover, to tackle the optimization problem in confused pixels, two novel loss functions based on information entropy are presented. On two challenging CD datasets, our approach outperforms other SOTA FCN-based methods, with only benchmark-level parameter numbers and quite low computational overhead.

翻译：最近,FCN在CD领域引起了广泛的关注。在追求更好的CD性能的过程中,设计更深、更复杂的FCN已经成为一种趋势,设计更深、更复杂的FCN,这不可避免地带来大量的参数和难以承受的计算负担。为了设计一个更深的结构以获得更精确的CD结果,同时减少参数数字以提高效率,我们在这项工作中提出了一个非常深而高效的CD网络,名为EffCDNet。在EffCDNet中,为了减少与深层结构相关的众多参数,引入了一种高效的组合,包括深度的共振和配有频道平流机制的组合,以取代标准的平流层。在特定的网络结构中,EffCDNet不使用主流的CD结果来获取更精确的CD结果,而同时使用非常深的编码和轻量的解码来提高效率。在非常深层的CD网络中,两个非常深的平流由高效的连结流首先从输入的图像中提取出两个具有高度代表性和内容丰富的特征图解图。随后,一个高效的 ASPC模块旨在捕捉取多层次的平级的平级平流数据,因此,在两部的平级的平级平级平级平级平级平级平级平级平级平级平级平级平面图。