With the advancement of deep learning techniques, an increasing number of methods have been proposed for optic disc and cup (OD/OC) segmentation from the fundus images. Clinically, OD/OC segmentation is often annotated by multiple clinical experts to mitigate the personal bias. However, it is hard to train the automated deep learning models on multiple labels. A common practice to tackle the issue is majority vote, e.g., taking the average of multiple labels. However such a strategy ignores the different expertness of medical experts. Motivated by the observation that OD/OC segmentation is often used for the glaucoma diagnosis clinically, in this paper, we propose a novel strategy to fuse the multi-rater OD/OC segmentation labels via the glaucoma diagnosis performance. Specifically, we assess the expertness of each rater through an attentive glaucoma diagnosis network. For each rater, its contribution for the diagnosis will be reflected as an expertness map. To ensure the expertness maps are general for different glaucoma diagnosis models, we further propose an Expertness Generator (ExpG) to eliminate the high-frequency components in the optimization process. Based on the obtained expertness maps, the multi-rater labels can be fused as a single ground-truth which we dubbed as Diagnosis First Ground-truth (DiagFirstGT). Experimental results show that by using DiagFirstGT as ground-truth, OD/OC segmentation networks will predict the masks with superior glaucoma diagnosis performance.
翻译:随着深层学习技术的进步,人们为光碟和杯(OD/OC)从基金图象中分离出来提出了越来越多的方法。临床上,多临床专家往往对OD/OC部分进行附加说明,以减轻个人偏见。然而,很难在多个标签上培训自动深层学习模型。解决该问题的通常做法是多数投票,例如采用多标签的平均值。然而,这种战略忽视了医学专家的不同专长。由于观察到OD/OC部分经常用于光谱诊断临床诊断,我们在本文件中提出了一个新的战略,通过光谱诊断性能将多鼠/OC部分标签连接起来。具体地说,我们通过关注的格洛科马诊断网络来评估每个定级器的专长。对于每个定级器而言,其对诊断的贡献将反映在专家性图中。为确保不同光谱诊断模型具有一般性,我们进一步提议一个专家性强的GLOD/OC部分(GODG),通过GLOO级诊断性能分析性能作为高地面平面图,我们用高地面平面图像展示了高地面分析结果。