Although tremendous strides have been made in face detection, one of the remaining open challenges is to achieve real-time speed on the CPU as well as maintain high performance, since effective models for face detection tend to be computationally prohibitive. To address this challenge, we propose a novel face detector, named FaceBoxes, with superior performance on both speed and accuracy. Specifically, our method has a lightweight yet powerful network structure that consists of the Rapidly Digested Convolutional Layers (RDCL) and the Multiple Scale Convolutional Layers (MSCL). The RDCL is designed to enable FaceBoxes to achieve real-time speed on the CPU. The MSCL aims at enriching the receptive fields and discretizing anchors over different layers to handle faces of various scales. Besides, we propose a new anchor densification strategy to make different types of anchors have the same density on the image, which significantly improves the recall rate of small faces. As a consequence, the proposed detector runs at 20 FPS on a single CPU core and 125 FPS using a GPU for VGA-resolution images. Moreover, the speed of FaceBoxes is invariant to the number of faces. We comprehensively evaluate this method and present state-of-the-art detection performance on several face detection benchmark datasets, including the AFW, PASCAL face, and FDDB.
翻译:尽管在面对探测方面已经取得了巨大的进步,但仍然存在的公开挑战之一是在CPU上实现实时速度,并保持高性能,因为有效的面对面探测模型往往在计算上令人望而却步。为了应对这一挑战,我们提出了名为FaceBoxes的新的脸色探测器,其速度和准确性都表现优异。具体地说,我们的方法有一个轻量而强大的网络结构,由快速增长的革命层(RDCL)和多规模的革命层(MSCL)组成。RDCL的设计是使FaceBoxes能够在CPU上实现实时速度。MSL旨在丰富可接受的场和不同层的离散锚,以便处理不同尺寸的面部。此外,我们提出了一个新的锁定战略,使不同类型的锚在图像上具有同样的密度,从而大大提高了小面部的回溯率。因此,拟议的探测器在单一CPU核心上运行20 FPS,使用GPU在VGGA分辨率图像上运行125 FPSS。此外,FaceBxx的快速度和当前检测方法,包括我们目前对PSAS的状态进行了评估。