Social media has seen a worrying rise in hate speech in recent times. Branching to several distinct categories of cyberbullying, gender discrimination, or racism, the combined label for such derogatory content can be classified as toxic content in general. This paper presents experimentation with a Keras wrapped lightweight BERT model to successfully identify hate speech and predict probabilistic impact score for the same to extract the hateful words within sentences. The dataset used for this task is the Hate Speech and Offensive Content Detection (HASOC 2021) data from FIRE 2021 in English. Our system obtained a validation accuracy of 82.60%, with a maximum F1-Score of 82.68%. Subsequently, our predictive cases performed significantly well in generating impact scores for successful identification of the hate tweets as well as the hateful words from tweet pools.
翻译:最近,社交媒体的仇恨言论出现了令人担忧的上升。在网络欺凌、性别歧视或种族主义等几类不同类型的网络欺凌、性别歧视或种族主义中,这类贬损内容的合并标签可被归为一般有毒内容。本文展示了Keras包装轻重量BERT模型的实验,以成功识别仇恨言论,并预测同一种语言的概率影响得分,从而在句中提取仇恨词。这项任务使用的数据集是来自FIRE 2021英文版的仇恨言词和攻击性内容探测(HASOC 2021)数据。我们的系统获得了82.60%的验证准确性,最高F1-STRO为82.68%。随后,我们的预测案例在成功识别仇恨推特以及推特库中的恶言方面产生了显著的影响分数。