R-CNN:争取平衡学习以探测物体 (Libra R-CNN: Towards Balanced Learning for Object Detection)

Compared with model architectures, the training process, which is also crucial to the success of detectors, has received relatively less attention in object detection. In this work, we carefully revisit the standard training practice of detectors, and find that the detection performance is often limited by the imbalance during the training process, which generally consists in three levels - sample level, feature level, and objective level. To mitigate the adverse effects caused thereby, we propose Libra R-CNN, a simple but effective framework towards balanced learning for object detection. It integrates three novel components: IoU-balanced sampling, balanced feature pyramid, and balanced L1 loss, respectively for reducing the imbalance at sample, feature, and objective level. Benefitted from the overall balanced design, Libra R-CNN significantly improves the detection performance. Without bells and whistles, it achieves 2.5 points and 2.0 points higher Average Precision (AP) than FPN Faster R-CNN and RetinaNet respectively on MSCOCO.

翻译：与模型结构相比,对探测器成功也至关重要的培训过程在物体探测方面受到的关注相对较少,在这项工作中,我们仔细重新审视探测器的标准培训做法,发现探测性能往往受到培训过程不平衡的限制,培训过程一般分为三个层次:抽样水平、特征水平和客观水平;为减轻由此造成的有害影响,我们提议利布拉 R-CNN,这是实现物体探测平衡学习的一个简单而有效的框架;它包括三个新颖的组成部分:IoU平衡抽样、平衡的地貌金字塔和平衡的L1损失,分别用于减少抽样、特征和客观水平的不平衡;从总体平衡设计中受益的Libra R-CNN显著改进了探测性能;没有钟和哨子,它平均精度分别达到2.5分和2.0分高于FPN更快的R-CNN和RetinaNet。

相关内容

R-CNN

关注 26

R-CNN的全称是Region-CNN，它可以说是是第一个成功将深度学习应用到目标检测上的算法。传统的目标检测方法大多以图像识别为基础。一般可以在图片上使用穷举法选出所所有物体可能出现的区域框，对这些区域框提取特征并使用图像识别方法分类，得到所有分类成功的区域后,通过非极大值抑制(Non-maximumsuppression)输出结果。

近期必读的六篇计算机视觉顶会ECCV 2020【目标检测】相关论文

专知会员服务

59+阅读 · 2020年7月7日

【旷视-CVPR2020】领域自适应对象检测的探索类别正则化，Exploring Categorical Regularization for Domain Adaptive Object Detection

专知会员服务

38+阅读 · 2020年3月23日

【AAAI2020】Context-Transformer:上下文转换器:解决对象混淆的小样本检测，Context-Transformer: Tackling Object Confusion for Few-Shot Detection

专知会员服务

51+阅读 · 2020年3月17日

【TPAMI2020】目标检测中的不平衡问题:综述论文，34页pdf

专知会员服务

55+阅读 · 2020年3月16日