Vision-based semantic segmentation of waterbodies and nearby related objects provides important information for managing water resources and handling flooding emergency. However, the lack of large-scale labeled training and testing datasets for water-related categories prevents researchers from studying water-related issues in the computer vision field. To tackle this problem, we present ATLANTIS, a new benchmark for semantic segmentation of waterbodies and related objects. ATLANTIS consists of 5,195 images of waterbodies, as well as high quality pixel-level manual annotations of 56 classes of objects, including 17 classes of man-made objects, 18 classes of natural objects and 21 general classes. We analyze ATLANTIS in detail and evaluate several state-of-the-art semantic segmentation networks on our benchmark. In addition, a novel deep neural network, AQUANet, is developed for waterbody semantic segmentation by processing the aquatic and non-aquatic regions in two different paths. AQUANet also incorporates low-level feature modulation and cross-path modulation for enhancing feature representation. Experimental results show that the proposed AQUANet outperforms other state-of-the-art semantic segmentation networks on ATLANTIS. We claim that ATLANTIS is the largest waterbody image dataset for semantic segmentation providing a wide range of water and water-related classes and it will benefit researchers of both computer vision and water resources engineering.
翻译:水体和附近相关物体的基于视觉的语义分解为管理水资源和处理水灾紧急情况提供了重要信息,然而,由于缺少与水有关的类别所需的大规模标记培训和测试数据集,研究人员无法在计算机的视觉领域研究与水有关的问题。为解决这一问题,我们提出了ATLANTIS,这是水体和相关物体的语义分解的新基准。ATLANTIS由5 195个水体图像组成,以及高质量的像素级人工说明,共56类物体,包括17个人造物体班、18个自然物体班和21个普通班。我们详细分析ATLANTIS,并评估我们基准上的若干最先进的语义分解网络。此外,我们还开发了一个新的深神经网络,AQUANet,通过在两个不同路径上处理水体和非水体区域。AQUANet还包含低级别地貌调和交叉路调制,以加强地貌表现。我们详细分析的ATLANTIS, 实验结果显示,在我们的基准上,AQANA-LAAAA部分中,我们提出了有关其区域图系的亚域图部分。