Deep learning based image enhancement models have largely improved the readability of fundus images in order to decrease the uncertainty of clinical observations and the risk of misdiagnosis. However, due to the difficulty of acquiring paired real fundus images at different qualities, most existing methods have to adopt synthetic image pairs as training data. The domain shift between the synthetic and the real images inevitably hinders the generalization of such models on clinical data. In this work, we propose an end-to-end optimized teacher-student framework to simultaneously conduct image enhancement and domain adaptation. The student network uses synthetic pairs for supervised enhancement, and regularizes the enhancement model to reduce domain-shift by enforcing teacher-student prediction consistency on the real fundus images without relying on enhanced ground-truth. Moreover, we also propose a novel multi-stage multi-attention guided enhancement network (MAGE-Net) as the backbones of our teacher and student network. Our MAGE-Net utilizes multi-stage enhancement module and retinal structure preservation module to progressively integrate the multi-scale features and simultaneously preserve the retinal structures for better fundus image quality enhancement. Comprehensive experiments on both real and synthetic datasets demonstrate that our framework outperforms the baseline approaches. Moreover, our method also benefits the downstream clinical tasks.
翻译:深层学习的图像增强模型在很大程度上提高了Fundus图像的可读性,以减少临床观察的不确定性和误诊的风险。然而,由于难以以不同品质获得配对真实的基金图像,大多数现有方法必须采用合成图像配对作为培训数据。合成图像与真实图像之间的领域变化不可避免地妨碍临床数据模型的普及。在这项工作中,我们提议了一个端至端优化师资-学生框架,以同时进行图像增强和域适应。学生网络使用合成对子来监督强化,并通过在不依赖强化地面图象的情况下对实际基金图像实施师资预测一致性来规范增强模型以降低域档次。此外,我们还提议建立一个新型多阶段多级关注指导增强网络(MAGE-Net),作为我们师生网络的骨干。我们的MAGEG-Net利用多阶段增强模块和视网结构维护模块,以逐步整合多级特征,同时维护视距结构以降低域位,同时通过对实际基金图像进行常规化,同时对真实图像的预测一致性进行规范,而无需依赖强化的地面图象质量。我们还提议一个新的多阶段性模型模型,同时测试。