Propaganda can be defined as a form of communication that aims to influence the opinions or the actions of people towards a specific goal; this is achieved by means of well-defined rhetorical and psychological devices. Propaganda, in the form we know it today, can be dated back to the beginning of the 17th century. However, it is with the advent of the Internet and the social media that it has started to spread on a much larger scale than before, thus becoming major societal and political issue. Nowadays, a large fraction of propaganda in social media is multimodal, mixing textual with visual content. With this in mind, here we propose a new multi-label multimodal task: detecting the type of propaganda techniques used in memes. We further create and release a new corpus of 950 memes, carefully annotated with 22 propaganda techniques, which can appear in the text, in the image, or in both. Our analysis of the corpus shows that understanding both modalities together is essential for detecting these techniques. This is further confirmed in our experiments with several state-of-the-art multimodal models.
翻译:宣传可以被定义为一种通信形式,其目的是影响人们的意见或行动,以实现具体目标;这是通过明确界定的口头和心理手段实现的;以我们今天所知的形式进行的宣传可以追溯到17世纪初;然而,随着互联网和社交媒体的出现,它开始在比以前大得多的范围内传播,从而成为重大的社会和政治问题。现在,社交媒体中的大部分宣传是多式的,将文字内容与视觉内容混杂在一起。我们在此提出一个新的多标签多式联运任务:发现在迷me中使用的宣传技术的类型。我们进一步创建和发布一套950美分的新材料,其中附有22种精心加注的宣传技术,这些技术可以在文字、图像或两者中出现。我们对各种材料的分析表明,理解这两种方式对于探测这些技术都是至关重要的。我们用几种最先进的多式联运模式进行的实验进一步证实了这一点。