Scarcity of high quality annotated images remains a limiting factor for training accurate image segmentation models. While more and more annotated datasets become publicly available, the number of samples in each individual database is often small. Combining different databases to create larger amounts of training data is appealing yet challenging due to the heterogeneity as a result of differences in data acquisition and annotation processes, often yielding incompatible or even conflicting information. In this paper, we investigate and propose several strategies for learning from partially overlapping labels in the context of abdominal organ segmentation. We find that combining a semi-supervised approach with an adaptive cross entropy loss can successfully exploit heterogeneously annotated data and substantially improve segmentation accuracy compared to baseline and alternative approaches.
翻译:缺乏高质量的附加说明图像仍然是培训准确图像分解模型的一个限制因素。虽然越来越多的附加说明的数据集公开提供,但每个数据库的样本数量往往很少。将不同数据库合并以创造更多培训数据是颇具吸引力的,但由于数据获取和批注过程的差异,往往产生不兼容甚至相互矛盾的信息,因此具有挑战性。在本文件中,我们调查并提出了几项战略,以便从腹部器官分解过程中部分重叠的标签中学习。我们发现,将半监督办法与适应性跨倍增损耗相结合,可以成功地利用多式附加说明数据,并大大提高分解准确度,与基线和替代方法相比。