Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0
翻译:模因是新时代社交媒体上幽默传播机制。模因通常包括一张图片和一些文字。模因可以用来传播错误信息或令人憎恶的东西,因此有必要进行详细的研究。我们引入了Memotion 3,这是一个新的数据集,包含了10000个已注释的模因。与该领域中其他普遍存在的数据集不同,包括Memotion的先前版本,Memotion 3引入了印度英语混合模因,而之前的工作仅限于英语模因。我们描述了Memotion任务,数据收集和数据集创建方法。我们还提供了该任务的基线。代码和数据集基线将在https://github.com/Shreyashm16/Memotion-3.0 上公开。