Semantic segmentation has a wide array of applications ranging from medical-image analysis, scene understanding, autonomous driving and robotic navigation. This work deals with medical image segmentation and in particular with accurate polyp detection and segmentation during colonoscopy examinations. Several convolutional neural network architectures have been proposed to effectively deal with this task and with the problem of segmenting objects at different scale input. The basic architecture in image segmentation consists of an encoder and a decoder: the first uses convolutional filters to extract features from the image, the second is responsible for generating the final output. In this work, we compare some variant of the DeepLab architecture obtained by varying the decoder backbone. We compare several decoder architectures, including ResNet, Xception, EfficentNet, MobileNet and we perturb their layers by substituting ReLU activation layers with other functions. The resulting methods are used to create deep ensembles which are shown to be very effective. Our experimental evaluations show that our best ensemble produces good segmentation results by achieving high evaluation scores with a dice coefficient of 0.884, and a mean Intersection over Union (mIoU) of 0.818 for the Kvasir-SEG dataset. To improve reproducibility and research efficiency the MATLAB source code used for this research is available at GitHub: https://github.com/LorisNanni.
翻译:语义分解有多种多样的应用,包括医学图像分析、现场理解、自主驾驶和机器人导航等。这项工作涉及医学图像分解,特别是结肠镜检查期间的精确聚点检测和分解。提议了若干进化神经网络结构,以有效处理这项任务和不同规模输入的分解对象问题。图像分解的基本结构包括一个编码器和一个解解码器:首先使用卷式过滤器从图像中提取特征,第二个是生成最终输出的责任。在这项工作中,我们比较了通过不同解码主干线主干柱获得的DeepLab结构的一些变异。我们比较了几个解码结构,包括ResNet、Xeption、EffincentNet、移动网络,以及我们通过将雷卢激活层与其他功能相替换来渗透它们的层。因此,我们用方法来创建显示非常有效的深团团。我们的最佳组合通过在高估评估中得出良好的分解结果,用的是高分数来进行数值评估,用高分解码来提高G0884/LAAT的DNA研究效率,用的是高分解码来改进GIAIS数据库。