Images can convey rich semantics and induce various emotions in viewers. Recently, with the rapid advancement of emotional intelligence and the explosive growth of visual data, extensive research efforts have been dedicated to affective image content analysis (AICA). In this survey, we will comprehensively review the development of AICA in the recent two decades, especially focusing on the state-of-the-art methods with respect to three main challenges -- the affective gap, perception subjectivity, and label noise and absence. We begin with an introduction to the key emotion representation models that have been widely employed in AICA and description of available datasets for performing evaluation with quantitative comparison of label noise and dataset bias. We then summarize and compare the representative approaches on (1) emotion feature extraction, including both handcrafted and deep features, (2) learning methods on dominant emotion recognition, personalized emotion prediction, emotion distribution learning, and learning from noisy data or few labels, and (3) AICA based applications. Finally, we discuss some challenges and promising research directions in the future, such as image content and context understanding, group emotion clustering, and viewer-image interaction.
翻译:最近,随着情感智能的迅速发展以及视觉数据的爆炸性增长,我们专门开展了广泛的研究工作,以进行感知图像内容分析(AICA),在这次调查中,我们将全面审查近二十年来AICA的发展动态,特别是围绕三大挑战,即影响差距、感知主观性、标签噪音和不存在等最先进的方法。我们首先介绍AICA广泛使用的关键情感代表模式,并介绍现有评价数据集与标签噪音和数据集偏差的定量比较。然后我们总结和比较有代表性的方法:(1) 情感特征提取,包括手工艺和深层特征;(2) 主导情感识别、个性化情感预测、情感分布学习和从噪音数据或少数标签中学习的学习方法;(3) ACA应用程序。最后,我们讨论一些挑战和未来有希望的研究方向,例如图像内容和背景理解、群体情感组合和视觉互动。