Over the past few years, deep learning methods have shown remarkable results in many face-related tasks including automatic facial expression recognition (FER) in-the-wild. Meanwhile, numerous models describing the human emotional states have been proposed by the psychology community. However, we have no clear evidence as to which representation is more appropriate and the majority of FER systems use either the categorical or the dimensional model of affect. Inspired by recent work in multi-label classification, this paper proposes a novel multi-task learning (MTL) framework that exploits the dependencies between these two models using a Graph Convolutional Network (GCN) to recognize facial expressions in-the-wild. Specifically, a shared feature representation is learned for both discrete and continuous recognition in a MTL setting. Moreover, the facial expression classifiers and the valence-arousal regressors are learned through a GCN that explicitly captures the dependencies between them. To evaluate the performance of our method under real-world conditions we train our models on AffectNet dataset. The results of our experiments show that our method outperforms the current state-of-the-art methods on discrete FER.
翻译:过去几年来,深层次的学习方法在许多与面相有关的任务中取得了显著成果,包括脸部表情的自动识别(FER),同时心理学界提出了许多描述人类情感状态的模型,然而,我们没有明确证据表明哪些代表更合适,大多数FER系统使用直线或维度的影响模型。在近期多标签分类工作启发下,本文件提出一个新的多任务学习(MTL)框架,利用这两个模型之间的依赖性,利用图表革命网络(GCN)来识别眼部的面部表达。具体地说,在MTL环境中为独立和连续的识别学习了共同的特征表现。此外,面部表达分类和价值反射器是通过一个GCN来学习的,该GCN明确捕捉到它们之间的依赖性。为了评估我们的方法在现实条件下的绩效,我们用AffectNet数据集来培训我们的模型。我们的实验结果显示,我们的方法超越了当前分流方式。