Over the recent decades, there has been a significant increase and development of resources for Arabic natural language processing. This includes the task of exploring Arabic Language Sentiment Analysis (ALSA) from Arabic utterances in both Modern Standard Arabic (MSA) and different Arabic dialects. This study focuses on detecting sentiment in poems written in Misurata Arabic sub-dialect spoken in Misurata, Libya. The tools used to detect sentiment from the dataset are Sklearn as well as Mazajak sentiment tool 1. Logistic Regression, Random Forest, Naive Bayes (NB), and Support Vector Machines (SVM) classifiers are used with Sklearn, while the Convolutional Neural Network (CNN) is implemented with Mazajak. The results show that the traditional classifiers score a higher level of accuracy as compared to Mazajak which is built on an algorithm that includes deep learning techniques. More research is suggested to analyze Arabic sub-dialect poetry in order to investigate the aspects that contribute to sentiments in these multi-line texts; for example, the use of figurative language such as metaphors.
翻译:近几十年来,用于阿拉伯文自然语言处理的资源有了显著的增加和发展,其中包括从现代标准阿拉伯文和不同阿拉伯方言的阿拉伯语词句中探索阿拉伯语感应分析(ALSA)的任务。这项研究的重点是检测在利比亚米苏拉塔的米苏拉塔阿拉伯语次对流中写出的诗中的情绪。用于检测数据集情绪的工具是斯克莱恩和马扎哈克情绪工具1. 物流回归、随机森林、纳维贝斯(NB)和支助矢量机器分类器(SVM)与斯克莱恩一起使用,而进化神经网络(CNN)则与Mazajak一起实施。结果显示,传统分类器的准确度高于Mazajak,而Mazajak的算法是建立在包含深层学习技术的算法之上。建议进行更多的研究,分析阿拉伯语次对流量诗,以便调查这些多线文本中情感的方面;例如,使用比喻语言,例如比喻语言。