While Convolutional Neural Networks (CNNs) have significantly boosted the performance of face related algorithms, maintaining accuracy and efficiency simultaneously in practical use remains challenging. The state-of-the-art methods employ deeper networks for better performance, which makes it less practical for mobile applications because of more parameters and higher computational complexity. Therefore, we propose an efficient multitask neural network, Alignment & Tracking & Pose Network (ATPN) for face alignment, face tracking and head pose estimation. Specifically, to achieve better performance with fewer layers for face alignment, we introduce a shortcut connection between shallow-layer and deep-layer features. We find the shallow-layer features are highly correspond to facial boundaries that can provide the structural information of face and it is crucial for face alignment. Moreover, we generate a cheap heatmap based on the face alignment result and fuse it with features to improve the performance of the other two tasks. Based on the heatmap, the network can utilize both geometric information of landmarks and appearance information for head pose estimation. The heatmap also provides attention clues for face tracking. The face tracking task also saves us the face detection procedure for each frame, which also significantly boost the real-time capability for video-based tasks. We experimentally validate ATPN on four benchmark datasets, WFLW, 300VW, WIDER Face and 300W-LP. The experimental results demonstrate that it achieves better performance with much less parameters and lower computational complexity compared to other light models.
翻译:虽然进化神经网络(CNNs)极大地提升了面部相关算法的性能,但实际使用时保持准确性和效率仍然具有挑战性。最先进的方法采用更深的网络来提高性能,因为参数更多,计算复杂程度更高,使得移动应用更不那么实用。因此,我们提议建立一个高效的多任务神经网络,对面调整、对面跟踪和跟踪和波斯网络(ATPN),以进行面部调整、对面跟踪和头部估计。具体地说,为了用较少的层进行更佳的性能,我们引入了浅层和深层特征之间的捷径连接。我们发现浅层的参数高度符合面部界限,能够提供面部结构信息,对面部调整至关重要。此外,我们根据面对齐结果制作了廉价的热图,将其与改善其他两项任务的性能结合起来。根据热映图,网络可以利用300个地标和外观信息的几何信息来做出估计。热图还提供了面对面跟踪的线索。我们发现面部参数也非常符合面线的界限,我们进行更低的面跟踪任务,并且测量了每个实验性数据测试,我们还要展示了每个测试的四框架。