Diversity of the features extracted by deep neural networks is important for enhancing the model generalization ability and accordingly its performance in different learning tasks. Facial expression recognition in the wild has attracted interest in recent years due to the challenges existing in this area for extracting discriminative and informative features from occluded images in real-world scenarios. In this paper, we propose a mechanism to diversify the features extracted by CNN layers of state-of-the-art facial expression recognition architectures for enhancing the model capacity in learning discriminative features. To evaluate the effectiveness of the proposed approach, we incorporate this mechanism in two state-of-the-art models to (i) diversify local/global features in an attention-based model and (ii) diversify features extracted by different learners in an ensemble-based model. Experimental results on three well-known facial expression recognition in-the-wild datasets, AffectNet, FER+, and RAF-DB, show the effectiveness of our method, achieving the state-of-the-art performance of 89.99% on RAF-DB, 89.34% on FER+ and the competitive accuracy of 60.02% on AffectNet dataset.
翻译:深层神经网络所提取的特征多样性对于提高模型概括能力并因此提高不同学习任务的业绩十分重要。近年来,由于这一领域在从现实世界情景中隐蔽图像中提取歧视性和信息性特征方面存在挑战,野生的表面表现承认近年来引起了人们的兴趣。在本文件中,我们建议了一种机制,使CNN的面部表现识别层所提取的特征多样化,以提高学习歧视特征的模型能力。为了评价拟议方法的有效性,我们将这一机制纳入两个最先进的模型,以便(一) 使关注模型中的地方/全球特征多样化,(二) 使不同学习者在基于共性模型中提取的特征多样化。关于电离层数据集、AffectNet、FER+和RAF-DB三种著名面部面部的面部表达识别层的实验结果,显示了我们方法的有效性,实现了RAF-DB89.99%、FER+89.34 % 和A60.02% 的网络数据竞争准确性。