While general object detection with deep learning has achieved great success in the past few years, the performance and efficiency of detecting small objects are far from satisfactory. The most common and effective way to promote small object detection is to use high-resolution images or feature maps. However, both approaches induce costly computation since the computational cost grows squarely as the size of images and features increases. To get the best of two worlds, we propose QueryDet that uses a novel query mechanism to accelerate the inference speed of feature-pyramid based object detectors. The pipeline composes two steps: it first predicts the coarse locations of small objects on low-resolution features and then computes the accurate detection results using high-resolution features sparsely guided by those coarse positions. In this way, we can not only harvest the benefit of high-resolution feature maps but also avoid useless computation for the background area. On the popular COCO dataset, the proposed method improves the detection mAP by 1.0 and mAP-small by 2.0, and the high-resolution inference speed is improved to 3.0x on average. On VisDrone dataset, which contains more small objects, we create a new state-of-the-art while gaining a 2.3x high-resolution acceleration on average. Code is available at: https://github.com/ChenhongyiYang/QueryDet-PyTorch
翻译:虽然经过深层学习的普通物体探测在过去几年中取得了巨大成功,但探测小物体的性能和效率远不尽人意。促进小物体探测的最常见有效方法是使用高分辨率图像或地貌图。然而,两种方法都会引起昂贵的计算,因为计算成本随着图像和地貌的大小增加而急剧增加。为了获得两个世界的最佳结果,我们建议QuryDet使用一种新颖的查询机制来加速基于地谱极光仪的物体探测器的推断速度。管道包含两个步骤:它首先预测低分辨率特征的小物体的粗略位置,然后利用这些粗度位置的低分辨率特征低微地计算准确的探测结果。这样,我们不仅可以收获高分辨率地貌图的收益,而且还可以避免对背景区域进行无效的计算。在流行的COCOCO数据集中,拟议的方法将探测 mAP 改进1.0 和 mAP 小于2.0,而高分辨率推断速度则提高到平均3.0x。在VisDrone Q上,我们创建了高分辨率-2.3-y-y-dex 数据,在高分辨率-degal-deal-de-de-de-debas-de-comcet上我们正在获取了一个新的分辨率。