Medical image segmentation is the technique that helps doctor view and has a precise diagnosis, particularly in Colorectal Cancer. Specifically, with the increase in cases, the diagnosis and identification need to be faster and more accurate for many patients; in endoscopic images, the segmentation task has been vital to helping the doctor identify the position of the polyps or the ache in the system correctly. As a result, many efforts have been made to apply deep learning to automate polyp segmentation, mostly to ameliorate the U-shape structure. However, the simple skip connection scheme in UNet leads to deficient context information and the semantic gap between feature maps from the encoder and decoder. To deal with this problem, we propose a novel framework composed of ConvNeXt backbone and Multi Kernel Positional Embedding block. Thanks to the suggested module, our method can attain better accuracy and generalization in the polyps segmentation task. Extensive experiments show that our model achieves the Dice coefficient of 0.8818 and the IOU score of 0.8163 on the Kvasir-SEG dataset. Furthermore, on various datasets, we make competitive achievement results with other previous state-of-the-art methods.
翻译:医疗图象分解是有助于医生观察和精确诊断的技术,特别是在直肠癌中。具体地说,随着病例的增加,诊断和鉴定需要更快,对许多病人来说更加准确;在内骨图中,分解任务对于帮助医生正确识别聚苯醚或系统骨骼的位置至关重要,因此,已作出许多努力,将深层次学习应用于自动聚变分解,主要是为了改善U形结构。然而,UNet的简单跳过连接方案导致背景信息不足,而且编码器和解密器的特征图之间出现语义差距。为了解决这一问题,我们提议了一个由ConvNeXt脊柱和多核心定位嵌入块组成的新框架。由于采用了所建议的模块,我们的方法可以在聚谱分解任务中实现更准确和概括化。广泛的实验表明,我们的模型在Kvasir-SEG数据集上实现了0.8818的Dice系数和0.8163 IOU分。此外,在各种数据设置方面,我们与其他先前的州-州-州-州-州-州-州数据集上取得了竞争性的成果。