Extracting roads from high-resolution remote sensing images (HRSIs) is vital in a wide variety of applications, such as autonomous driving, path planning, and road navigation. Due to the long and thin shape as well as the shades induced by vegetation and buildings, small-sized roads are more difficult to discern. In order to improve the reliability and accuracy of small-sized road extraction when roads of multiple sizes coexist in an HRSI, an enhanced deep neural network model termed Dual-Decoder-U-Net (DDU-Net) is proposed in this paper. Motivated by the U-Net model, a small decoder is added to form a dual-decoder structure for more detailed features. In addition, we introduce the dilated convolution attention module (DCAM) between the encoder and decoders to increase the receptive field as well as to distill multi-scale features through cascading dilated convolution and global average pooling. The convolutional block attention module (CBAM) is also embedded in the parallel dilated convolution and pooling branches to capture more attention-aware features. Extensive experiments are conducted on the Massachusetts Roads dataset with experimental results showing that the proposed model outperforms the state-of-the-art DenseUNet, DeepLabv3+ and D-LinkNet by 6.5%, 3.3%, and 2.1% in the mean Intersection over Union (mIoU), and by 4%, 4.8%, and 3.1% in the F1 score, respectively. Both ablation and heatmap analyses are presented to validate the effectiveness of the proposed model.
翻译:高分辨率遥感图像(HRSI)提取道路对于各种应用(如自主驾驶、路径规划和道路导航)至关重要。由于植被和建筑物诱发的长、薄的形状以及阴影,小型道路更难辨别。为了提高小型道路提取的可靠性和准确性,当多尺寸道路同时存在于一个HRSI时,本文件提议了一个称为双十二-U-Net(DDU-Net)的强化深神经网络模型。在U-Net模型的激励下,增加了一个小型解码器,形成一个双解码器结构,用于更详细的功能。此外,我们引入了编码器和解码器之间的变速关注模块(DCAM),以增加可接收场,并通过电解析变速变速度和全球平均集合来蒸发多级特征。 变速模型显示的变速和合并分数模块(CBAM)也嵌入了平行的变速变速和合并分解器,用于收集更多注意度的DVAV3 和变速解码解码的DR3 。在马萨洛的模型中进行广泛的实验结果,通过Slimal-sal-slational-slationalationalation 4-lation 和Suplevlationaldaldaldaldald daldaldaldaldald dalddaldaldaldald 和Fmaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldaldd 。