Glaucoma is a chronic visual disease that may cause permanent irreversible blindness. Measurement of the cup-to-disc ratio (CDR) plays a pivotal role in the detection of glaucoma in its early stage, preventing visual disparities. Therefore, accurate and automatic segmentation of optic disc (OD) and optic cup (OC) from retinal fundus images is a fundamental requirement. Existing CNN-based segmentation frameworks resort to building deep encoders with aggressive downsampling layers, which suffer from a general limitation on modeling explicit long-range dependency. To this end, in this paper, we propose a new segmentation pipeline, called UT-Net, availing the advantages of U-Net and transformer both in its encoding layer, followed by an attention-gated bilinear fusion scheme. In addition to this, we incorporate Multi-Head Contextual attention to enhance the regular self-attention used in traditional vision transformers. Thus low-level features along with global dependencies are captured in a shallow manner. Besides, we extract context information at multiple encoding layers for better exploration of receptive fields, and to aid the model to learn deep hierarchical representations. Finally, an enhanced mixing loss is proposed to tightly supervise the overall learning process. The proposed model has been implemented for joint OD and OC segmentation on three publicly available datasets: DRISHTI-GS, RIM-ONE R3, and REFUGE. Additionally, to validate our proposal, we have performed exhaustive experimentation on Glaucoma detection from all three datasets by measuring the Cup to Disc Ratio (CDR) value. Experimental results demonstrate the superiority of UT-Net as compared to the state-of-the-art methods.
翻译:Glaucoma是一种慢性视觉疾病,可能会导致永久性的不可逆转失明。测量杯点与盘点比率(CDR)在早期检测青光眼中发挥着关键作用,防止视觉差异。因此,光碟和光杯(OC)的准确和自动分解是基本要求,基于CNN的现有分解框架可被用于建造深度的分解器,具有进取性下游层,在建模明显的远程依赖性方面受到普遍限制。为此,我们在本文件中提出一个新的分解管道,称为UT-Net,在其初始阶段使用U-Net和变异器的优势,防止视觉差异。因此,光碟盘盘盘和光杯(OC)的准确和自动分解是基本要求的。此外,我们把多位背景关注点放在提高传统视觉变异像器中的正常自留状态上,从而以浅薄的方式捕捉到与全球依赖性关系。 此外,我们从多个编码层收集了信息,以更好地探索可接受的域,用UT-Net(UNet)网络和变压器的优势优势,在公共层层中利用U-Net和变动的ULODRDR3 演示结果。最后,提议采用三代数据演示的模型,以学习现有数据。拟议进行升级数据演示。拟议进行升级数据演示。最新数据,以学习。最后进行升级数据演示,以学习。</s>