The prevalence of toxic content on social media platforms, such as hate speech, offensive language, and misogyny, presents serious challenges to our interconnected society. These challenging issues have attracted widespread attention in Natural Language Processing (NLP) community. In this paper, we present the submitted systems to the first Arabic Misogyny Identification shared task. We investigate three multi-task learning models as well as their single-task counterparts. In order to encode the input text, our models rely on the pre-trained MARBERT language model. The overall obtained results show that all our submitted models have achieved the best performances (top three ranked submissions) in both misogyny identification and categorization tasks.
翻译:社会媒体平台上有毒内容的流行,如仇恨言论、攻击性语言和厌恶女性等,给我们的相互联系的社会带来了严重挑战。这些具有挑战性的问题引起了自然语言处理社区的广泛关注。在本文中,我们向第一个阿拉伯Misogyny识别共同任务介绍所提交的系统。我们调查了三个多任务学习模式及其单任务对应任务。为了对输入文本进行编码,我们的模型依赖经过预先培训的MARBERT语言模式。总体结果显示,我们提交的所有模式在错误识别和分类任务方面都取得了最佳成绩(最多三个提交文件)。