Fully convolutional neural network (FCN) has been dominating the game of face detection task for a few years with its congenital capability of sliding-window-searching with shared kernels, which boiled down all the redundant calculation, and most recent state-of-the-art methods such as Faster-RCNN, SSD, YOLO and FPN use FCN as their backbone. So here comes one question: Can we find a universal strategy to further accelerate FCN with higher accuracy, so could accelerate all the recent FCN-based methods? To analyze this, we decompose the face searching space into two orthogonal directions, `scale' and `spatial'. Only a few coordinates in the space expanded by the two base vectors indicate foreground. So if FCN could ignore most of the other points, the searching space and false alarm should be significantly boiled down. Based on this philosophy, a novel method named scale estimation and spatial attention proposal ($S^2AP$) is proposed to pay attention to some specific scales and valid locations in the image pyramid. Furthermore, we adopt a masked-convolution operation based on the attention result to accelerate FCN calculation. Experiments show that FCN-based method RPN can be accelerated by about $4\times$ with the help of $S^2AP$ and masked-FCN and at the same time it can also achieve the state-of-the-art on FDDB, AFW and MALF face detection benchmarks as well.
翻译:完全进化的神经网络(FCN)多年来一直主导着面部探测任务游戏。 几年来,我们以先天式的滑窗搜索能力,先天式的滑窗搜索能力,以共享的内核为方向,将所有多余的计算结果和最新的最先进方法,如Affering-RCNN、SSD、YOLO和FPN,将FCN作为其主干线。因此,这里有一个问题:我们能否找到一种通用战略,以更精确的方式进一步加速FCN,这样就可以加快所有最近的FCN基准?为了分析这一点,我们将面部搜索空间分解成两个或两个或两个方向,即“规模”和“空间”。只有两个基矢量扩展的空间中的一些坐标表明地面。因此,如果FCN能够忽略大多数其他点,搜索空间和错误的警报应该被大大压缩。基于这一理念,一种名为“比例估计”和“空间关注”建议的新方法,目的是关注某些特定的尺度和CN$CN的图像金字型。 此外,我们还可以在FAR的加速模型上显示一个快速的模型计算结果。