Deep facial expression recognition faces two challenges that both stem from the large number of trainable parameters: long training times and a lack of interpretability. We propose a novel method based on evolutionary algorithms, that deals with both challenges by massively reducing the number of trainable parameters, whilst simultaneously retaining classification performance, and in some cases achieving superior performance. We are robustly able to reduce the number of parameters on average by 95% (e.g. from 2M to 100k parameters) with no loss in classification accuracy. The algorithm learns to choose small patches from the image, relative to the nose, which carry the most important information about emotion, and which coincide with typical human choices of important features. Our work implements a novel form attention and shows that evolutionary algorithms are a valuable addition to machine learning in the deep learning era, both for reducing the number of parameters for facial expression recognition and for providing interpretable features that can help reduce bias.
翻译:深面面部识别面临两个挑战,这两个挑战都来自大量可训练参数:长时间的训练时间和缺乏可解释性。我们提出了基于进化算法的新颖方法,该方法通过大量减少可训练参数的数量来应对这两个挑战,同时保留分类性能,并在某些情况下实现优异性能。我们完全能够将参数数量平均减少95%(例如从2M到100k参数),分类准确性没有损失。算法学会从图像中选择小片片片段,相对于鼻子而言,它包含关于情感的最重要信息,与人类对重要特征的典型选择相吻合。我们的工作采用了一种新式的关注,并表明进化算法是对深层次学习时代机器学习的一种宝贵补充,既可以减少面部表达识别参数的数量,也可以提供有助于减少偏见的可解释特征。