Two-stage detectors are state-of-the-art in object detection as well as pedestrian detection. However, the current two-stage detectors are inefficient as they do bounding box regression in multiple steps i.e. in region proposal networks and bounding box heads. Also, the anchor-based region proposal networks are computationally expensive to train. We propose F2DNet, a novel two-stage detection architecture which eliminates redundancy of current two-stage detectors by replacing the region proposal network with our focal detection network and bounding box head with our fast suppression head. We benchmark F2DNet on top pedestrian detection datasets, thoroughly compare it against the existing state-of-the-art detectors and conduct cross dataset evaluation to test the generalizability of our model to unseen data. Our F2DNet achieves 8.7\%, 2.2\%, and 6.1\% MR-2 on City Persons, Caltech Pedestrian, and Euro City Person datasets respectively when trained on a single dataset and reaches 20.4\% and 26.2\% MR-2 in heavy occlusion setting of Caltech Pedestrian and City Persons datasets when using progressive fine-tunning. Furthermore, F2DNet have significantly lesser inference time compared to the current state-of-the-art. Code and trained models will be available at https://github.com/AbdulHannanKhan/F2DNet.
翻译:在物体探测和行人探测方面,两阶段探测器是最新的两阶段探测器,但目前的两阶段探测器效率不高,因为它们在多个步骤(即区域建议网络和捆绑箱头)中进行捆绑盒回归。此外,基于锚基区域投标书网络的计算成本很高。我们提出F2DNet,这是一个新型的两阶段探测结构,它通过用我们的焦点探测网络取代区域建议网络,用我们的快速抑制头来捆绑箱头来消除现有两阶段探测器的冗余。我们在高层行人探测数据集上以F2DNet为基准,与现有的最新探测器进行彻底比较,并进行交叉数据集评估,以测试我们模型对隐蔽数据的一般可操作性。我们的F2D网络在进行单一数据集培训并达到20.4 ⁇ 和26.2 ⁇ MMM-2时,分别实现了8.7 ⁇ 、2.2 ⁇ 和6.1 ⁇ MR-2关于城市人员、Caltech Pedestrian、Caltech Pedstrian/CRM2-M-2, 将使用进步的FND 模型进行较低的测试。