Recently, FCNs have attracted widespread attention in the CD field. In pursuit of better CD performance, it has become a tendency to design deeper and more complicated FCNs, which inevitably brings about huge numbers of parameters and an unbearable computational burden. With the goal of designing a quite deep architecture to obtain more precise CD results while simultaneously decreasing parameter numbers to improve efficiency, in this work, we present a very deep and efficient CD network, entitled EffCDNet. In EffCDNet, to reduce the numerous parameters associated with deep architecture, an efficient convolution consisting of depth-wise convolution and group convolution with a channel shuffle mechanism is introduced to replace standard convolutional layers. In terms of the specific network architecture, EffCDNet does not use mainstream UNet-like architecture, but rather adopts the architecture with a very deep encoder and a lightweight decoder. In the very deep encoder, two very deep siamese streams stacked by efficient convolution first extract two highly representative and informative feature maps from input image-pairs. Subsequently, an efficient ASPP module is designed to capture multi-scale change information. In the lightweight decoder, a recurrent criss-cross self-attention (RCCA) module is applied to efficiently utilize non-local similar feature representations to enhance discriminability for each pixel, thus effectively separating the changed and unchanged regions. Moreover, to tackle the optimization problem in confused pixels, two novel loss functions based on information entropy are presented. On two challenging CD datasets, our approach outperforms other SOTA FCN-based methods, with only benchmark-level parameter numbers and quite low computational overhead.


翻译:最近,FCN在CD领域引起了广泛的关注。在追求更好的CD性能的过程中,设计更深、更复杂的FCN已经成为一种趋势,设计更深、更复杂的FCN,这不可避免地带来大量的参数和难以承受的计算负担。为了设计一个更深的结构以获得更精确的CD结果,同时减少参数数字以提高效率,我们在这项工作中提出了一个非常深而高效的CD网络,名为EffCDNet。在EffCDNet中,为了减少与深层结构相关的众多参数,引入了一种高效的组合,包括深度的共振和配有频道平流机制的组合,以取代标准的平流层。在特定的网络结构中,EffCDNet不使用主流的CD结果来获取更精确的CD结果,而同时使用非常深的编码和轻量的解码来提高效率。在非常深层的CD网络中,两个非常深的平流由高效的连结流首先从输入的图像中提取出两个具有高度代表性和内容丰富的特征图解图。随后,一个高效的 ASPC模块旨在捕捉取多层次的平级的平级平流数据,因此,在两部的平级的平级平级平级平级平级平级平级平级平级平级平级平级平面图。

0
下载
关闭预览

相关内容

CVPR2020 | 商汤-港中文等提出PV-RCNN:3D目标检测新网络
专知会员服务
43+阅读 · 2020年4月17日
【ICLR-2020】网络反卷积,NETWORK DECONVOLUTION
专知会员服务
38+阅读 · 2020年2月21日
《DeepGCNs: Making GCNs Go as Deep as CNNs》
专知会员服务
30+阅读 · 2019年10月17日
[综述]深度学习下的场景文本检测与识别
专知会员服务
77+阅读 · 2019年10月10日
视频理解 S3D,I3D-GCN,SlowFastNet, LFB
极市平台
7+阅读 · 2019年1月31日
disentangled-representation-papers
CreateAMind
26+阅读 · 2018年9月12日
Single-Shot Object Detection with Enriched Semantics
统计学习与视觉计算组
14+阅读 · 2018年8月29日
Relation Networks for Object Detection 论文笔记
统计学习与视觉计算组
16+阅读 · 2018年4月18日
【推荐】全卷积语义分割综述
机器学习研究会
19+阅读 · 2017年8月31日
Auto-Encoding GAN
CreateAMind
7+阅读 · 2017年8月4日
Arxiv
11+阅读 · 2019年4月15日
VIP会员
Top
微信扫码咨询专知VIP会员