协调统一输出平衡, 用于极均衡输入数据的语义分割 (Harmonizing Output Imbalance for semantic segmentation on extremely-imbalanced input data)

Semantic segmentation is a high level computer vision task that assigns a label for each pixel of an image. It is challengeful to deal with extremely-imbalanced data in which the ratio of target ixels to background pixels is lower than 1:1000. Such severe input imbalance leads to output imbalance for poor model training. This paper considers three issues for extremely-imbalanced data: inspired by the region based loss, an implicit measure for the output imbalance is proposed, and an adaptive algorithm is designed for guiding the output imbalance hyperparameter selection; then it is generalized to distribution based loss for dealing with output imbalance; and finally a compound loss with our adaptive hyperparameter selection alogorithm can keep the consistency of training and inference for harmonizing the output imbalance. With four popular deep architectures on our private dataset with three input imbalance scales and three public datasets, extensive experiments demonstrate the ompetitive/promising performance of the proposed method.

翻译：语义分割是一个高层次的计算机视野任务, 它为图像的每个像素指定一个标签。要处理极均衡的数据, 其目标像素与背景像素的比例要低于1: 1 000 。如此严重的输入不平衡导致模型训练差的输出不平衡。本文审议了极均衡数据的三个问题: 受基于区域的损失的启发, 提议了产出不平衡的隐含度, 并设计了一个适应性算法来指导产出不平衡的超参数选择; 然后它被普遍地用于处理产出不平衡的分布性损失; 最后, 我们适应性超参数选择的反光度可以保持培训和推论的一致性, 以协调产出不平衡。我们的私人数据集有四个广受欢迎的深层结构, 有三个输入不平衡尺度和三个公共数据集, 广泛的实验展示了拟议方法的ompetivave/保证性表现。