Facial Expression Recognition (FER) is a classification task that points to face variants. Hence, there are certain affinity features between facial expressions, receiving little attention in the FER literature. Convolution padding, despite helping capture the edge information, causes erosion of the feature map simultaneously. After multi-layer filling convolution, the output feature map named albino feature definitely weakens the representation of the expression. To tackle these challenges, we propose a novel architecture named Amending Representation Module (ARM). ARM is a substitute for the pooling layer. Theoretically, it can be embedded in the back end of any network to deal with the Padding Erosion. ARM efficiently enhances facial expression representation from two different directions: 1) reducing the weight of eroded features to offset the side effect of padding, and 2) sharing affinity features over mini-batch to strengthen the representation learning. Experiments on public benchmarks prove that our ARM boosts the performance of FER remarkably. The validation accuracies are respectively 92.05% on RAF-DB, 65.2% on Affect-Net, and 58.71% on SFEW, exceeding current state-of-the-art methods. Our implementation and trained models are available at https://github.com/JiaweiShiCV/Amend-Representation-Module.
翻译:显性表达度识别( FER) 是一个分类任务, 指向变量 。 因此, 面部表达形式之间有一些相似性, 在 FER 文献中很少引起注意 。 革命悬浮, 尽管帮助捕捉边缘信息, 却同时导致地貌图的侵蚀 。 在多层填充卷变后, 名为 albino 的输出特征地图肯定会削弱表达方式的表达方式 。 为了应对这些挑战, 我们提议了一个名为 Amending 代表模块( ARM) 的新结构 。 ARM 是集合层的替代品。 从理论上讲, 它可以嵌入任何网络的后端, 以便与帕丁· 埃罗斯ion 打交道。 ARM 有效地加强了两个不同方向的面貌表现方式:1) 降低被侵蚀的特征的重量, 以抵消垫面效应的副作用 。 2) 以小型包连接方式共享亲近性特征来强化表达方式的表达方式 。 公共基准实验证明我们的ARM- DB 的验证范围分别是92. 05%, Amt- Net 和 Afect- Net 和 58. 7.1- 模型已经超过 MA- fas- fas- sma- spal- 。