Medical image segmentation is routinely performed to isolate regions of interest, such as organs and lesions. Currently, deep learning is the state of the art for automatic segmentation, but is usually limited by the need for supervised training with large datasets that have been manually segmented by trained clinicians. The goal of semi-superised and unsupervised image segmentation is to greatly reduce, or even eliminate, the need for training data and therefore to minimze the burden on clinicians when training segmentation models. To this end we introduce a novel network architecture for capable of unsupervised and semi-supervised image segmentation called TricycleGAN. This approach uses three generative models to learn translations between medical images and segmentation maps using edge maps as an intermediate step. Distinct from other approaches based on generative networks, TricycleGAN relies on shape priors rather than colour and texture priors. As such, it is particularly well-suited for several domains of medical imaging, such as ultrasound imaging, where commonly used visual cues may be absent. We present experiments with TricycleGAN on a clinical dataset of kidney ultrasound images and the benchmark ISIC 2018 skin lesion dataset.
翻译:目前,深层次的学习是自动分解的先进技术,但通常受到以下因素的限制:需要以受过训练的临床医生人工分解的大型数据集进行监督培训。半封闭和未经监督的图像分解的目标是大大减少或甚至消除培训数据的需求,从而在培训分解模型时缩小临床医生的负担。为此,我们引入了一个新型网络结构,以能够进行不受监督和半监督的图像分解,称为TricyclyGAN。这个方法使用三种基因化模型学习医疗图像和分解图之间的翻译,作为中间步骤。不同于基于基因化网络的其他方法,TricyclyGAN依靠形状的前身而不是颜色和纹理前身。因此,它特别适合医学成像的若干领域,例如超声成像,在那里可能缺乏常用的视觉提示。我们用TriccyclyGAN进行肾脏超声波图像临床数据集实验,并基准ISISB18数据库。