Segmentation has emerged as a fundamental field of computer vision and natural language processing, which assigns a label to every pixel/feature to extract regions of interest from an image/text. To evaluate the performance of segmentation, the Dice and IoU metrics are used to measure the degree of overlap between the ground truth and the predicted segmentation. In this paper, we establish a theoretical foundation of segmentation with respect to the Dice/IoU metrics, including the Bayes rule and Dice/IoU-calibration, analogous to classification-calibration or Fisher consistency in classification. We prove that the existing thresholding-based framework with most operating losses are not consistent with respect to the Dice/IoU metrics, and thus may lead to a suboptimal solution. To address this pitfall, we propose a novel consistent ranking-based framework, namely RankDice/RankIoU, inspired by plug-in rules of the Bayes segmentation rule. Three numerical algorithms with GPU parallel execution are developed to implement the proposed framework in large-scale and high-dimensional segmentation. We study statistical properties of the proposed framework. We show it is Dice-/IoU-calibrated, and its excess risk bounds and the rate of convergence are also provided. The numerical effectiveness of RankDice/mRankDice is demonstrated in various simulated examples and Fine-annotated CityScapes and Pascal VOC datasets with state-of-the-art deep learning architectures.
翻译:作为计算机视觉和自然语言处理的基本领域,出现了一个计算机视觉和自然语言处理的基本领域,它给每个像素/特性贴上标签,从图像/文本中提取感兴趣的区域。为了评估分解的性能,使用Dice和IoU衡量标准来衡量地面真相与预测分解之间的重叠程度。在本文中,我们为Dice/IoU衡量标准,包括Bayes规则和Dice/IoU校准,以及分类中的分类校准和渔业的一致性,建立了一个理论基础。我们证明,现有的基于临界值的框架与大多数运行损失的Dice/IoU衡量标准不相符,因此可能导致亚优性的解决办法。为了解决这一缺陷,我们提出了一个新的、一致的排序框架,即Scranc Dice/RankIoU,在Bayes分解规则的插接规则的启发下。与GPUPU平行执行的三种数字算法,以大规模和高度的分解或高维度的分解框架相近。我们用S-S-Simal-ral-ralal-ral-licalalal-lialalationationalation-Istal-deal-destreval 和Servial-deal-vial-vial-vial-vial-vial-vial-vial-vial-vial-vial-vial-vial-vial-view.我们,我们提供各种的统计学的统计学框架,我们,我们提供了各种的代的统计学的多的统计学标准。我们研究。我们提供了各种的统计学积积积。我们学习和多的统计学框架。