Colonoscopy, currently the most efficient and recognized colon polyp detection technology, is necessary for early screening and prevention of colorectal cancer. However, due to the varying size and complex morphological features of colonic polyps as well as the indistinct boundary between polyps and mucosa, accurate segmentation of polyps is still challenging. Deep learning has become popular for accurate polyp segmentation tasks with excellent results. However, due to the structure of polyps image and the varying shapes of polyps, it easy for existing deep learning models to overfitting the current dataset. As a result, the model may not process unseen colonoscopy data. To address this, we propose a new State-Of-The-Art model for medical image segmentation, the SSFormer, which uses a pyramid Transformer encoder to improve the generalization ability of models. Specifically, our proposed Progressive Locality Decoder can be adapted to the pyramid Transformer backbone to emphasize local features and restrict attention dispersion. The SSFormer achieves statet-of-the-art performance in both learning and generalization assessment.
翻译:Colonoscopy目前是最高效和公认的结肠聚体检测技术,对早期筛查和预防结肠癌十分必要,但是,由于结肠聚体的大小和复杂形态特征各不相同,加上聚苯醚和粘膜之间的分界界限不同,对聚虫的精确分解仍然具有挑战性。对于精确的聚合分解任务来说,深层次的学习越来越受欢迎,效果极佳。然而,由于聚虫图像的结构以及聚虫的不同形状,现有的深层学习模型很容易过度适应当前的数据集。因此,模型可能不会处理看不见的结肠镜学数据。为此,我们提出了一种新的医学图象分解的“国家-地方-艺术”模型,即“SSFormer”模型,它使用金字塔变形变形器编码器来提高模型的通用能力。具体地,我们拟议的“渐进地方解码器”可以适应金字形变形的骨架,以强调本地特征并限制注意力分散。因此,SFormer公司在学习和一般化评估中都达到“状态”。