Although tremendous strides have been made in face detection, one of the remaining open challenges is to achieve real-time speed on the CPU as well as maintain high performance, since effective models for face detection tend to be computationally prohibitive. To address this challenge, we propose a novel face detector, named FaceBoxes, with superior performance on both speed and accuracy. Specifically, our method has a lightweight yet powerful network structure that consists of the Rapidly Digested Convolutional Layers (RDCL) and the Multiple Scale Convolutional Layers (MSCL). The RDCL is designed to enable FaceBoxes to achieve real-time speed on the CPU. The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales. Besides, we propose a new anchor densification strategy to make different types of anchors have the same density on the image, which significantly improves the recall rate of small faces. As a consequence, the proposed detector runs at 20 FPS on a single CPU core and 125 FPS using a GPU for VGA-resolution images. Moreover, the speed of FaceBoxes is invariant to the number of faces. We comprehensively evaluate this method and present state-of-the-art detection performance on several face detection benchmark datasets, including the AFW, PASCAL face, and FDDB. Code is available at https://github.com/sfzhang15/FaceBoxes
翻译:尽管在面对探测方面已经取得了巨大的进步,但仍然存在的公开挑战之一是在CPU上实现实时速度,并保持高性能,因为有效的面对面检测模型往往在计算上令人望而却步。为了应对这一挑战,我们提出了名为FaceBoxes的新的脸色探测器,其速度和精确性能均优于FaceBoxes。具体地说,我们的方法有一个轻量但强大的网络结构,由快速增长的革命层(RDCL)和多规模的革命层(MSCL)组成。RDCL的设计是使FaceBoxes能够在CPU上实现实时速度。MSL的目的是为了在不同的层次上丰富可容纳的字段和拆散锚,以便处理不同比例的面部。此外,我们提出了一个新的锁定定位战略,使不同种类的锚在图像上具有同样的密度,从而大大提高了小面部的回溯率。因此,拟议的探测器在单一的CPU15 和125 FPPSFS, 使用GPU(VGA-L) 图像的实时速度。此外,FaceBxes 的探测速度是目前, 和AFALTSBSBSBSD的状态。