Micro-expression recognition (MER) is valuable because the involuntary nature of micro-expressions (MEs) can reveal genuine emotions. Most works recognize MEs by taking RGB videos or images as input. In fact, the activated facial regions in ME images are very small and the subtle motion can be easily submerged in the unrelated information. Facial landmarks are a low-dimensional and compact modality, which leads to much lower computational cost and can potentially concentrate more on ME-related features. However, the discriminability of landmarks for MER is not clear. Thus, this paper explores the contribution of facial landmarks and constructs a new framework to efficiently recognize MEs with sole facial landmark information. Specially, we design a separate structure module to separately aggregate the spatial and temporal information in the geometric movement graph based on facial landmarks, and a Geometric Two-Stream Graph Network is constructed to aggregate the low-order geometric information and high-order semantic information of facial landmarks. Furthermore, two core components are proposed to enhance features. Specifically, a semantic adjacency matrix can automatically model the relationship between nodes even long-distance nodes in a self-learning fashion; and an Adaptive Action Unit loss is introduced to guide the learning process such that the learned features are forced to have a synchronized pattern with facial action units. Notably, this work tackles MER only utilizing geometric features, processed based on a graph model, which provides a new idea with much higher efficiency to promote MER. The experimental results demonstrate that the proposed method can achieve competitive or even superior performance with a significantly reduced computational cost, and facial landmarks can significantly contribute to MER and are worth further study for efficient ME analysis.
翻译:微表层识别(MER)是有价值的,因为微表层表现(MEs)的非自愿性质可以揭示真实的情感。 大部分工作都通过将 RGB 视频或图像作为输入来识别MEs。 事实上, ME 图像中活跃的面部区域非常小, 微妙的动作很容易被不相干的信息淹没在不相干的信息中。 地貌标志是一种低维和紧凑的方式, 导致计算成本低得多, 并可能更集中于与ME有关的特征。 但是, 市面标志的可偏差性并不十分清楚。 因此, 本文探讨了面部标志的可辨别作用, 并构建了一个新的框架, 以只使用面部标志信息来有效识别 ME 。 特别是, 我们设计了一个单独的结构模式, 以基于面部标志的几何运动图中的时间和时间信息分别汇总。 地理图的两面色图网络旨在汇总低排序信息, 和面部标志的建模模型只能用来加强特性。 具体来说, 语系比更高级的矩阵矩阵矩阵矩阵可以自动模拟地标度模型模型可以模拟, 。 和直径面层面层平面部模型可以显著地段段路面图分析, 。