Collective insights from a group of experts have always proven to outperform an individual's best diagnostic for clinical tasks. For the task of medical image segmentation, existing research on AI-based alternatives focuses more on developing models that can imitate the best individual rather than harnessing the power of expert groups. In this paper, we introduce a single diffusion model-based approach that produces multiple plausible outputs by learning a distribution over group insights. Our proposed model generates a distribution of segmentation masks by leveraging the inherent stochastic sampling process of diffusion using only minimal additional learning. We demonstrate on three different medical image modalities- CT, ultrasound, and MRI that our model is capable of producing several possible variants while capturing the frequencies of their occurrences. Comprehensive results show that our proposed approach outperforms existing state-of-the-art ambiguous segmentation networks in terms of accuracy while preserving naturally occurring variation. We also propose a new metric to evaluate the diversity as well as the accuracy of segmentation predictions that aligns with the interest of clinical practice of collective insights.
翻译:在临床任务中,来自一组专家的集体洞察力总是能够胜过单个人的最佳诊断。对于医学图像分割的任务,现有的基于人工智能的替代方法更多地关注于开发可以模仿最佳个人的模型,而不是利用专家群体的力量。本文介绍了一种基于单个扩散模型的方法,通过学习群体洞察力的分布,生成多种可能的输出。我们提出的模型利用了扩散的固有随机采样过程,仅使用最少的额外学习就能生成分割掩模的分布。我们演示了在三种不同的医学图像模态 - CT,超声和MRI上,我们的模型能够产生多个可能的变体,同时捕捉它们发生频率的情况。综合结果表明,我们提出的方法在精度方面优于现有的最先进的模糊分割网络,同时保留了自然发生的变化。我们还提出了一种新的度量标准,用于评估分割预测的多样性和准确性,与集体洞察力的临床实践相关。