Region anchors are the cornerstone of modern object detection techniques. State-of-the-art detectors mostly rely on a dense anchoring scheme, where anchors are sampled uniformly over the spatial domain with a predefined set of scales and aspect ratios. In this paper, we revisit this foundational stage. Our study shows that it can be done much more effectively and efficiently. Specifically, we present an alternative scheme, named Guided Anchoring, which leverages semantic features to guide the anchoring. The proposed method jointly predicts the locations where the center of objects of interest are likely to exist as well as the scales and aspect ratios at different locations. On top of predicted anchor shapes, we mitigate the feature inconsistency with a feature adaption module. We also study the use of high-quality proposals to improve detection performance. The anchoring scheme can be seamlessly integrated to proposal methods and detectors. With Guided Anchoring, we achieve $9.1\%$ higher recall on MS COCO with $90\%$ fewer anchors than the RPN baseline. We also adopt Guided Anchoring in Fast R-CNN, Faster R-CNN and RetinaNet, respectively improving the detection mAP by $2.2\%$, $2.7\%$ and $1.2\%$.
翻译:区域锚是现代天体探测技术的基石。 最新水平的探测器主要依靠密集锚定机制,在空间域上对锚定进行统一取样,并有一套预先确定的尺度和方位比率。 在本文中,我们重新审视这个基础阶段。 我们的研究显示,可以更有效益和效率地进行这项工作。 具体地说,我们提出了一个替代方案,名为“制导Anchoring”,它利用语义特征来引导锚定; 拟议的方法共同预测了可能存在对象中心的地点以及不同地点的标定比例和方位比率。 在预测的锚定形状上,我们用一个功能调整模块来减少特征不一致之处。 我们还研究如何使用高质量的建议来改进探测性能。 锚定方案可以顺利地整合到建议的方法和探测器中。 在“制导”中,我们用比 RPN 基线少90 美元的锚点来回顾 MS CO 。 我们还在快速的R-CN、更快的R-CN$N 和Retina Net中采用了方向An,分别用1.27美元和12美元来改进探测器。