Facial expression recognition is a challenging task when neural network is applied to pattern recognition. Most of the current recognition research is based on single source facial data, which generally has the disadvantages of low accuracy and low robustness. In this paper, a neural network algorithm of facial expression recognition based on multimodal data fusion is proposed. The algorithm is based on the multimodal data, and it takes the facial image, the histogram of oriented gradient of the image and the facial landmarks as the input, and establishes CNN, LNN and HNN three sub neural networks to extract data features, using multimodal data feature fusion mechanism to improve the accuracy of facial expression recognition. Experimental results show that, benefiting by the complementarity of multimodal data, the algorithm has a great improvement in accuracy, robustness and detection speed compared with the traditional facial expression recognition algorithm. Especially in the case of partial occlusion, illumination and head posture transformation, the algorithm also shows a high confidence.
翻译:当神经网络应用于模式识别时,面部表达式识别是一项具有挑战性的任务。当前的识别研究大多以单一源面部数据为基础,通常具有低精度和低强度的缺点。在本文中,提出了基于多式联运数据聚合的面部表达式识别神经网络算法。该算法以多式联运数据为基础,将面部图像、图像定向梯度直观图和面部标志作为输入,并建立了CNN、LNNN和HNNN的三个次神经网络来提取数据特征,利用多式联运数据特征聚合机制提高面部表达式识别的准确性。实验结果表明,由于多式联运数据的互补性,该算法在准确性、稳健性和检测速度方面与传统的面部表达式识别算法相比有很大的改进,特别是在部分封闭、照明和头部姿势转换的情况下,算法也显示出高度的信心。