Most of object detection algorithms can be categorized into two classes: two-stage detectors and one-stage detectors. Recently, many efforts have been devoted to one-stage detectors for the simple yet effective architecture. Different from two-stage detectors, one-stage detectors aim to identify foreground objects from all candidates in a single stage. This architecture is efficient but can suffer from the imbalance issue with respect to two aspects: the inter-class imbalance between the number of candidates from foreground and background classes and the intra-class imbalance in the hardness of background candidates, where only a few candidates are hard to be identified. In this work, we propose a novel distributional ranking (DR) loss to handle the challenge. For each image, we convert the classification problem to a ranking problem, which considers pairs of candidates within the image, to address the inter-class imbalance problem. Then, we push the distributions of confidence scores for foreground and background towards the decision boundary. After that, we optimize the rank of the expectations of derived distributions in lieu of original pairs. Our method not only mitigates the intra-class imbalance issue in background candidates but also improves the efficiency for the ranking algorithm. By merely replacing the focal loss in RetinaNet with the developed DR loss and applying ResNet-101 as the backbone, mAP of the single-scale test on COCO can be improved from 39.1% to 41.7% without bells and whistles, which demonstrates the effectiveness of the proposed loss function. Code is available at \url{https://github.com/idstcv/DR_loss}.
翻译:大多数目标检测算法可以分为两类:两阶段探测器和一阶段探测器。最近,许多努力都用于一阶段探测器,用于简单而有效的结构。不同于两阶段探测器,一阶段探测器的目的是在一个阶段中从所有候选人中识别前景对象。这一结构是有效的,但可能会在两个方面受到不平衡问题的影响:前层和背景类别候选人人数之间的阶层间不平衡,以及背景候选人的硬性阶层内部不平衡,其中只有少数候选人难以识别。在这项工作中,我们建议用一个新的分发等级(DR)损失来应对挑战。对于每个图像,我们将分类问题转换为等级问题,在单一阶段中考虑候选人的组合,以解决阶级间不平衡问题。然后,我们将前层和背景类别候选人之间的信任等级分配推向决定界限。之后,我们优化了从衍生分布到原始候选人的等级分布的等级分配的级别,在此情况下,我们的方法不仅减轻了背景候选人的内部分配不平衡问题,而且还提高了在39 AS/DR1 的级别测试效率,只是将READR 的升级为标准。