Convolutional Neural Networks (CNNs) have shown to be powerful medical image segmentation models. In this study, we address some of the main unresolved issues regarding these models. Specifically, training of these models on small medical image datasets is still challenging, with many studies promoting techniques such as transfer learning. Moreover, these models are infamous for producing over-confident predictions and for failing silently when presented with out-of-distribution (OOD) data at test time. In this paper, we advocate for multi-task learning, i.e., training a single model on several different datasets, spanning several different organs of interest and different imaging modalities. We show that not only a single CNN learns to automatically recognize the context and accurately segment the organ of interest in each context, but also that such a joint model often has more accurate and better-calibrated predictions than dedicated models trained separately on each dataset. Our experiments show that multi-task learning can outperform transfer learning in medical image segmentation tasks. For detecting OOD data, we propose a method based on spectral analysis of CNN feature maps. We show that different datasets, representing different imaging modalities and/or different organs of interest, have distinct spectral signatures, which can be used to identify whether or not a test image is similar to the images used to train a model. We show that this approach is far more accurate than OOD detection based on prediction uncertainty. The methods proposed in this paper contribute significantly to improving the accuracy and reliability of CNN-based medical image segmentation models.
翻译:进化神经网络(CNNs) 显示是强大的医学图像分割模型。 在这项研究中, 我们主张多任务学习, 即, 培训一个不同数据集的单一模型, 覆盖多个不同感兴趣的器官和不同的成像模式。 具体地说, 对这些模型进行小型医学图像数据集的培训仍然具有挑战性, 有许多研究推广诸如转移学习等技术。 此外, 这些模型在制作过度自信的预测和在测试时用不同分布( OOOD) 数据来显示无声无声无息的预测。 我们的实验显示, 多任务学习可以超越医学图像分割任务中的学习。 为了检测 OOD 数据, 我们建议了一种基于光谱分析分析的单一模型分析方法, 而不是基于多个不同器官的准确度和不同的成像模式。 我们用不同的数据模型来显示, 是否使用不同的感光谱模型来显示不同的图像, 我们用不同的数据来显示, 是否使用不同的感光谱模型来显示, 是否使用不同的感官测试方法来显示不同的感官。 我们用不同的感光谱模型来显示, 显示不同的感官的感象学是不同的感官。 我们用不同的感官测试方法可以显示不同的数据 显示不同的感光谱模型可以显示不同的感像。