Dense facial landmark detection is one of the key elements of face processing pipeline. It is used in virtual face reenactment, emotion recognition, driver status tracking, etc. Early approaches were suitable for facial landmark detection in controlled environments only, which is clearly insufficient. Neural networks have shown an astonishing qualitative improvement for in-the-wild face landmark detection problem, and are now being studied by many researchers in the field. Numerous bright ideas are proposed, often complimentary to each other. However, exploration of the whole volume of novel approaches is quite challenging. Therefore, we present this survey, where we summarize state-of-the-art algorithms into categories, provide a comparison of recently introduced in-the-wild datasets (e.g., 300W, AFLW, COFW, WFLW) that contain images with large pose, face occlusion, taken in unconstrained conditions. In addition to quality, applications require fast inference, and preferably on mobile devices. Hence, we include information about algorithm inference speed both on desktop and mobile hardware, which is rarely studied. Importantly, we highlight problems of algorithms, their applications, vulnerabilities, and briefly touch on established methods. We hope that the reader will find many novel ideas, will see how the algorithms are used in applications, which will enable further research.
翻译:高面部标志性检测是面部处理管道的关键要素之一。 它用于虚拟面部重新反应、情绪识别、驱动器状态跟踪等。 早期方法显然不足以在受控制的环境中进行面部标志性检测,这显然是不够的。 神经网络显示,对眼部显眼的标志性检测问题,在质量上取得了惊人的改进,目前许多实地研究人员正在研究。 提出了许多明亮的想法,而且往往互为补充。 但是,探索全量的新办法相当具有挑战性。 因此,我们在此调查中,我们将最新工艺的算法归纳为类别,对最近引入的电动数据集(例如,300W、ALFW、COFW、WFLW)进行了比较,这些数据集含有大面部、面部隔离、在不受限制的条件下拍摄的图像。 除了质量外,应用程序需要快速推导,而且最好是移动设备。 因此,我们把关于桌面和移动硬件的算法速度的信息都包含在很少研究的类别中, 我们强调最近引入的算法问题, 其应用将如何让读者了解它们使用新的方法。