Micro-expression has emerged as a promising modality in affective computing due to its high objectivity in emotion detection. Despite the higher recognition accuracy provided by the deep learning models, there are still significant scope for improvements in micro-expression recognition techniques. The presence of micro-expressions in small-local regions of the face, as well as the limited size of available databases, continue to limit the accuracy in recognizing micro-expressions. In this work, we propose a facial micro-expression recognition model using 3D residual attention network named MERANet to tackle such challenges. The proposed model takes advantage of spatial-temporal attention and channel attention together, to learn deeper fine-grained subtle features for classification of emotions. Further, the proposed model encompasses both spatial and temporal information simultaneously using the 3D kernels and residual connections. Moreover, the channel features and spatio-temporal features are re-calibrated using the channel and spatio-temporal attentions, respectively in each residual module. Our attention mechanism enables the model to learn to focus on different facial areas of interest. The experiments are conducted on benchmark facial micro-expression datasets. A superior performance is observed as compared to the state-of-the-art for facial micro-expression recognition on benchmark data.
翻译:由于情感检测的高度客观性,微表情在感官计算中已成为一种很有希望的模式。尽管深层学习模型提供的准确度较高,但在微表情识别技术方面仍有很大的改进余地。微表情在面部小地区的存在以及现有数据库的有限规模继续限制微表情在承认微表情方面的准确性。在这项工作中,我们提出了一个面部微表情识别模型,使用名为MERANet的3D残余关注网络来应对这些挑战。拟议的模型利用空间时空关注和集中关注,学习更深微微微微微微微的微妙情感分类特征。此外,拟议的模型包含空间和时间信息,同时使用3D内核和残余连接。此外,频道特征和微孔-时空特征利用频道和阵列关注分别在每个残余模块中进行重新校准。我们的关注机制使模型能够学习不同的面部关注领域,从而学习更深细微表情细微的细细微的细微的细微分微分辨特征。还同时使用3D内核内核和剩余连接进行实验。观测到的高级的状态数据识别。