实现文化间效果的承认:在狂野的六方文化中的视听效果的承认 (Towards Intercultural Affect Recognition: Audio-Visual Affect Recognition in the Wild Across Six Cultures)

from arxiv, Accepted at IEEE International Conference on Automatic Face and Gesture Recognition (FG 2023), publication and presentation at refereed IEEE workshop

In our multicultural world, affect-aware AI systems that support humans need the ability to perceive affect across variations in emotion expression patterns across cultures. These systems must perform well in cultural contexts without annotated affect datasets available for training models. A standard assumption in affective computing is that affect recognition models trained and used within the same culture (intracultural) will perform better than models trained on one culture and used on different cultures (intercultural). We test this assumption and present the first systematic study of intercultural affect recognition models using videos of real-world dyadic interactions from six cultures. We develop an attention-based feature selection approach under temporal causal discovery to identify behavioral cues that can be leveraged in intercultural affect recognition models. Across all six cultures, our findings demonstrate that intercultural affect recognition models were as effective or more effective than intracultural models. We identify and contribute useful behavioral features for intercultural affect recognition; facial features from the visual modality were more useful than the audio modality in this study's context. Our paper presents a proof-of-concept and motivation for the future development of intercultural affect recognition systems, especially those deployed in low-resource situations without annotated data.

翻译：在我们的多文化世界中,支持人类的有影响的人工智能系统需要有能力感知不同文化间情感表达模式的差异。这些系统必须在文化环境中运行良好,而无需附加附加注释,影响培训模式可用的数据集。感知计算的标准假设是,影响在同一文化(内部文化)中培训和使用的识别模式的标准假设将比在单一文化和不同文化(跨文化)中使用的模型效果更好。我们测试这一假设,并使用来自六种文化的真实世界两面互动视频,首次系统研究文化间影响识别模式。我们在时间因果发现中开发了基于关注的特征选择方法,以确定在文化间识别模式中可以利用的行为提示。在所有六种文化中,我们的调查结果显示,文化间影响识别模式的效力或比文化内模式更有效。我们确定并促进有利于不同文化间认识的有益行为特征;视觉模式的面貌特征比本研究中的音频模式更有用。我们的文件为不同文化间识别系统的未来发展提供了一种证据概念和动力,特别是那些在没有附加注释的数据的低资源情况下部署的识别系统。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

计算机科学课程与视频课件合集，Computer Science courses with video lectures

专知会员服务

37+阅读 · 2022年1月24日

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

【快讯】ICML 2020论文出炉，1088篇上榜，你的paper中了吗？

专知会员服务

52+阅读 · 2020年6月1日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日