In this paper we survey and analyze modern neural-network-based facial landmark detection algorithms. We focus on approaches that have led to a significant increase in quality over the past few years on datasets with large pose and emotion variability, high levels of face occlusions - all of which are typical in real-world scenarios. We summarize the improvements into categories, provide quality comparison on difficult and modern in-the-wild datasets: 300-W, AFLW, WFLW, COFW. Additionally, we compare algorithm speed on CPU, GPU and Mobile devices. For completeness, we also briefly touch on established methods with open implementations available. Besides, we cover applications and vulnerabilities of the landmark detection algorithms. Based on which, we raise problems that as we hope will lead to further algorithm improvements.
翻译:在本文中,我们调查和分析现代神经-网络的面部里程碑式检测算法。我们侧重于在过去几年里导致大量提高以下数据组质量的方法:具有巨大面貌和情感变异性的数据集、高程度的面部隔离(在现实世界情景中都是典型的)。我们将改进情况归纳为类别,对困难和现代的网上数据集进行质量比较:300-W、ALFW、WFLW、COFW。此外,我们比较了CPU、GPU和移动设备的算法速度。为了完整起见,我们还简要地介绍了既有方法,并提供了公开的操作。此外,我们涵盖了里程碑式检测算法的应用和脆弱性。在此基础上,我们提出了我们希望能够导致进一步算法改进的问题。