We study two of the most popular performance metrics in medical image segmentation, Accuracy and Dice, when the target labels are noisy. For both metrics, several statements related to characterization and volume properties of the set of optimal segmentations are proved, and associated experiments are provided. Our main insights are: (i) the volume of the solutions to both metrics may deviate significantly from the expected volume of the target, (ii) the volume of a solution to Accuracy is always less than or equal to the volume of a solution to Dice and (iii) the optimal solutions to both of these metrics coincide when the set of feasible segmentations is constrained to the set of segmentations with the volume equal to the expected volume of the target.
翻译:摘要:本文研究医学图像分割中两种最常用的性能指标Accuracy和Dice在目标标签存在噪声时的情况。对于这两个指标,我们证明了几个相关的性质,以及提供了相应的实验。我们的主要结果是:(i)在两个指标的解集中,解集的体积可能与目标期望体积显著偏离,(ii)Accuracy的解集体积总是小于或等于Dice的解集体积,(iii)当可行的分割集合被限制为体积等于目标期望体积的分割集合时,这两个指标的最优解是相同的。