The automatic identification of harmful content online is of major concern for social media platforms, policymakers, and society. Researchers have studied textual, visual, and audio content, but typically in isolation. Yet, harmful content often combines multiple modalities, as in the case of memes, which are of particular interest due to their viral nature. With this in mind, here we offer a comprehensive survey with a focus on harmful memes. Based on a systematic analysis of recent literature, we first propose a new typology of harmful memes, and then we highlight and summarize the relevant state of the art. One interesting finding is that many types of harmful memes are not really studied, e.g., such featuring self-harm and extremism, partly due to the lack of suitable datasets. We further find that existing datasets mostly capture multi-class scenarios, which are not inclusive of the affective spectrum that memes can represent. Another observation is that memes can propagate globally through repackaging in different languages and that they can also be multilingual, blending different cultures. We conclude by highlighting several challenges related to multimodal semiotics, technological constraints and non-trivial social engagement, and we present several open-ended aspects such as delineating online harm and empirically examining related frameworks and assistive interventions, which we believe will motivate and drive future research.
翻译:网上有害内容的自动识别是社会媒体平台、决策者和社会的主要关切。 研究人员研究了文字、视觉和音频内容,但通常是孤立的。 然而,有害内容往往结合多种模式,如因病毒性质而特别感兴趣的Memes模式。 牢记这一点,我们在此提供一份全面的调查,重点是有害的Memes。 根据对最近文献的系统分析,我们首先建议对有害的Memes进行新的类型分类,然后我们强调和总结相关的艺术状况。一个有趣的发现是,许多类型的有害Memes没有真正受到研究,例如,以自我伤害和极端主义为主,部分原因是缺乏合适的数据集。我们进一步发现,现有数据集大多包含多级情景,没有包含有害的Memes所代表的影响频谱。另一个观察是,Memes可以通过不同语言的重新组合在全球传播,它们也可以是多语言的,融合不同的文化。我们通过强调与多式半调制、技术制约和非三联调的迷取性迷取性迷惑症等相关的挑战,我们通过在线研究来推动一些开放的社会干预和动态框架。