The role of predicting sarcasm in the text is known as automatic sarcasm detection. Given the prevalence and challenges of sarcasm in sentiment-bearing text, this is a critical phase in most sentiment analysis tasks. With the increasing popularity and usage of different social media platforms among users around the world, people are using sarcasm more and more in their day-to-day conversations, social media posts and tweets, and it is considered as a way for people to express their sentiment about some certain topics or issues. As a result of the increasing popularity, researchers started to focus their research endeavors on detecting sarcasm from a text in different languages especially the English language. However, the task of sarcasm detection is a challenging task due to the nature of sarcastic texts; which can be relative and significantly differs from one person to another depending on the topic, region, the user's mentality and other factors. In addition to these challenges, sarcasm detection in the Arabic language has its own challenges due to the complexity of the Arabic language, such as being morphologically rich, with many dialects that significantly vary between each other, while also being lowly resourced. In recent years, only few research attempts started tackling the task of sarcasm detection in Arabic, including creating and collecting corpora, organizing workshops and establishing baseline models. This paper intends to create a new humanly annotated Arabic corpus for sarcasm detection collected from tweets, and implementing a new approach for sarcasm detection and quantification in Arabic tweets. The annotation technique followed in this paper is unique in sarcasm detection and the proposed approach tackles the problem as a regression problem instead of classification; i.e., the model attempts to predict the level of sarcasm instead of binary classification.
翻译:预测文本中的讽刺内容的作用被称为自动讽刺性检测。 鉴于情绪化文本中的讽刺性讽刺性讽刺性言论的流行程度和挑战,这是大多数情绪分析任务的关键阶段。随着世界各地用户对不同社交媒体平台的日益普及和使用,人们在日常对话、社交媒体文章和推特中越来越多地使用讽刺性言论,并被视为人们表达对某些议题或问题的情绪的一种方式。由于受欢迎程度的提高,研究人员开始将其研究的重点放在从不同语言特别是英语的文本中发现讽刺性的讽刺性言论上。然而,讽刺性探测任务是一项具有挑战性的任务,因为讽刺性言论的文本的性质日益受到全世界的用户的欢迎和使用,在日常对话、社交媒体文章和推特中,人们越来越多地使用讽刺性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性言论性