Facial landmark localization aims to detect the predefined points of human faces, and the topic has been rapidly improved with the recent development of neural network based methods. However, it remains a challenging task when dealing with faces in unconstrained scenarios, especially with large pose variations. In this paper, we target the problem of facial landmark localization across large poses and address this task based on a split-and-aggregate strategy. To split the search space, we propose a set of anchor templates as references for regression, which well addresses the large variations of face poses. Based on the prediction of each anchor template, we propose to aggregate the results, which can reduce the landmark uncertainty due to the large poses. Overall, our proposed approach, named AnchorFace, obtains state-of-the-art results with extremely efficient inference speed on four challenging benchmarks, i.e. AFLW, 300W, Menpo, and WFLW dataset. Code will be released for reproduction.
翻译:面部里程碑式的本地化旨在探测人类面孔的预定点,随着神经网络方法的最近发展,这个专题已迅速改进。然而,在以不受限制的情景处理面孔时,这仍然是一项艰巨的任务,特别是造成巨大变异的情况。在本文件中,我们针对面部里程碑式本地化问题,在大面孔上定位的问题,以分裂和聚合的战略为基础,处理这项任务。为了将搜索空间分开,我们提议一套锚样板作为回归参考,很好地解决面孔的巨大变异。根据对每个锚模版的预测,我们提议将结果汇总起来,从而减少因大面孔造成的里程碑式不确定性。总体而言,我们提议的名为AnchorFace的方法,以极高效的推导速度,在四种具有挑战性的基准上,即ALFW、300W、Menpo和WLFW数据集上,以极高效的推导速获得最新结果。代码将发布供复制。