Facial Expression Recognition (FER) is crucial in many research domains because it enables machines to better understand human behaviours. FER methods face the problems of relatively small datasets and noisy data that don't allow classical networks to generalize well. To alleviate these issues, we guide the model to concentrate on specific facial areas like the eyes, the mouth or the eyebrows, which we argue are decisive to recognise facial expressions. We propose the Privileged Attribution Loss (PAL), a method that directs the attention of the model towards the most salient facial regions by encouraging its attribution maps to correspond to a heatmap formed by facial landmarks. Furthermore, we introduce several channel strategies that allow the model to have more degrees of freedom. The proposed method is independent of the backbone architecture and doesn't need additional semantic information at test time. Finally, experimental results show that the proposed PAL method outperforms current state-of-the-art methods on both RAF-DB and AffectNet.
翻译:为了缓解这些问题,我们引导模型集中关注眼睛、嘴或眉眉等特定面部区域,我们认为这些面部表情是识别面部表达方式的决定性因素。我们建议采用原始归因损失(PAL)方法,通过鼓励其属性图与面部标志形成的热映射相对应,将模型的注意力引向最突出的面部区域。此外,我们引入了几个使模型具有更大自由度的频道战略。拟议方法独立于主干结构,在测试时不需要额外的语义信息。最后,实验结果显示,拟议的PAL方法优于RAF-DB和AfffectNet目前的最新方法。