In this paper, we present our advanced solutions to the two sub-challenges of Affective Behavior Analysis in the wild (ABAW) 2023: the Emotional Reaction Intensity (ERI) Estimation Challenge and Expression (Expr) Classification Challenge. ABAW 2023 aims to tackle the challenge of affective behavior analysis in natural contexts, with the ultimate goal of creating intelligent machines and robots that possess the ability to comprehend human emotions, feelings, and behaviors. For the Expression Classification Challenge, we propose a streamlined approach that handles the challenges of classification effectively. However, our main contribution lies in our use of diverse models and tools to extract multimodal features such as audio and video cues from the Hume-Reaction dataset. By studying, analyzing, and combining these features, we significantly enhance the model's accuracy for sentiment prediction in a multimodal context. Furthermore, our method achieves outstanding results on the Emotional Reaction Intensity (ERI) Estimation Challenge, surpassing the baseline method by an impressive 84\% increase, as measured by the Pearson Coefficient, on the validation dataset.
翻译:在本文中,我们提出了针对《野外情感行为分析(ABAW)2023》两个子挑战(情感反应强度(ERI)估计挑战及表情(Expr)分类挑战)的先进解决方案。ABAW 2023 旨在解决自然环境下的情感行为分析挑战,最终目标是创建具备理解人类情感、感受和行为能力的智能机器和机器人。对于表情分类挑战,我们提出了一种简洁的方法,能够有效应对分类中的挑战。然而,我们的主要贡献在于利用多种模型和工具从Hume-Reaction数据集中提取多模态特征,如音频和视频信号。通过研究、分析和组合这些特征,我们在多模态情感预测方面显著提高了模型的准确性。此外,我们的方法在情感反应强度估计挑战中取得了出色的成绩,在验证数据集上,以皮尔逊系数衡量,相较于基线方法,提高了84%。