Image segmentation is important in medical imaging, providing valuable, quantitative information for clinical decision-making in diagnosis, therapy, and intervention. The state-of-the-art in automated segmentation remains supervised learning, employing discriminative models such as U-Net. However, training these models requires access to large amounts of manually labelled data which is often difficult to obtain in real medical applications. In such settings, semi-supervised learning (SSL) attempts to leverage the abundance of unlabelled data to obtain more robust and reliable models. Recently, generative models have been proposed for semantic segmentation, as they make an attractive choice for SSL. Their ability to capture the joint distribution over input images and output label maps provides a natural way to incorporate information from unlabelled images. This paper analyses whether deep generative models such as the SemanticGAN are truly viable alternatives to tackle challenging medical image segmentation problems. To that end, we thoroughly evaluate the segmentation performance, robustness, and potential subgroup disparities of discriminative and generative segmentation methods when applied to large-scale, publicly available chest X-ray datasets.
翻译:在医学成像中,图像分割很重要,为诊断、治疗和干预方面的临床决策提供了宝贵的定量信息。自动化分割方面的先进技术仍然是监督性学习,使用U-Net等歧视性模型。然而,培训这些模型需要获取大量手工贴标签的数据,而这些数据在真正的医疗应用中往往难以获得。在这种环境中,半监督学习试图利用大量未贴标签的数据获取更可靠可靠的模型。最近,为语义分割提出了基因分割模型,因为它们为SSL提供了一种吸引人的选择。它们捕捉输入图像和输出标签图的联合分布的能力为吸收来自未贴标签图像的信息提供了自然的途径。本文分析了诸如SermanticGAN等深层基因化模型是否真正可行,是解决挑战医学图像分割问题的替代方法。为此,我们彻底评估了在大规模、公开提供的胸X光数据集中应用的分解性性功能、稳健性和潜在分化方法的分组差异。