Memes are the new-age conveyance mechanism for humor on social media sites. Memes often include an image and some text. Memes can be used to promote disinformation or hatred, thus it is crucial to investigate in details. We introduce Memotion 3, a new dataset with 10,000 annotated memes. Unlike other prevalent datasets in the domain, including prior iterations of Memotion, Memotion 3 introduces Hindi-English Codemixed memes while prior works in the area were limited to only the English memes. We describe the Memotion task, the data collection and the dataset creation methodologies. We also provide a baseline for the task. The baseline code and dataset will be made available at https://github.com/Shreyashm16/Memotion-3.0
翻译:表情包是在社交媒体上传递幽默的新时代传递机制。表情包常常包含一个图像和一些文本。表情包可以用来宣传不实信息或仇恨,因此有必要进行详细调查。我们介绍了 Memotion 3,一个新的数据集,具有 10,000 个带注释的表情包。与该领域中其他普遍存在的数据集(包括之前的 Memotion 数据集)不同,Memotion 3 引入了印地 - 英文混合表情包,而先前的工作仅局限于英语表情包。我们描述了 Memotion 任务,数据收集和数据集创建方法。我们还为任务提供了基准。基准代码和数据集可在https://github.com/Shreyashm16/Memotion-3.0上获取。