Semantic segmentation is a high level computer vision task that assigns a label for each pixel of an image. It is challenging to deal with extremely-imbalanced data in which the ratio of target pixels to background pixels is lower than 1:1000. Such severe input imbalance leads to output imbalance for poor model training. This paper considers three issues for extremely-imbalanced data: inspired by the region-based Dice loss, an implicit measure for the output imbalance is proposed, and an adaptive algorithm is designed for guiding the output imbalance hyperparameter selection; then it is generalized to distribution-based loss for dealing with output imbalance; and finally a compound loss with our adaptive hyperparameter selection algorithm can keep the consistency of training and inference for harmonizing the output imbalance. With four popular deep architectures on our private dataset from three different input imbalance scales and three public datasets, extensive experiments demonstrate the competitive/promising performance of the proposed method.
翻译:语义分解是一项高层次的计算机愿景任务,为图像的每个像素指定一个标签。 处理极端均衡的数据非常困难, 目标像素与背景像素之比低于1: 11000。 这种严重的输入不平衡导致模型培训不力的产出不平衡。 本文审议了极不平衡数据的三个问题: 由基于区域的 Dice 损失引起的, 提出了产出不平衡的隐含计量, 并设计了一个适应性算法, 用于指导产出不平衡超参数的选择; 然后, 它被普遍化为基于分配的损失, 用于处理产出不平衡; 最后, 我们适应性超参数选择算法的复合损失可以保持培训和推论的一致性, 以协调产出不平衡。 由三个不同的输入不平衡尺度和三个公共数据集构成的四种受欢迎的深层结构, 广泛的实验显示了拟议方法的竞争/ 前景。