Curbing online hate speech has become the need of the hour; however, a blanket ban on such activities is infeasible for several geopolitical and cultural reasons. To reduce the severity of the problem, in this paper, we introduce a novel task, hate speech normalization, that aims to weaken the intensity of hatred exhibited by an online post. The intention of hate speech normalization is not to support hate but instead to provide the users with a stepping stone towards non-hate while giving online platforms more time to monitor any improvement in the user's behavior. To this end, we manually curated a parallel corpus - hate texts and their normalized counterparts (a normalized text is less hateful and more benign). We introduce NACL, a simple yet efficient hate speech normalization model that operates in three stages - first, it measures the hate intensity of the original sample; second, it identifies the hate span(s) within it; and finally, it reduces hate intensity by paraphrasing the hate spans. We perform extensive experiments to measure the efficacy of NACL via three-way evaluation (intrinsic, extrinsic, and human-study). We observe that NACL outperforms six baselines - NACL yields a score of 0.1365 RMSE for the intensity prediction, 0.622 F1-score in the span identification, and 82.27 BLEU and 80.05 perplexity for the normalized text generation. We further show the generalizability of NACL across other platforms (Reddit, Facebook, Gab). An interactive prototype of NACL was put together for the user study. Further, the tool is being deployed in a real-world setting at Wipro AI as a part of its mission to tackle harmful content on online platforms.
翻译:消除网络仇恨言论已成为当务之急;然而,出于若干地缘政治和文化原因,全面禁止此类活动是行不通的。为了降低问题的严重性,我们在本文件中引入了新颖且高效的仇恨言论正常化,目的是削弱网络文章所展示的仇恨情绪。仇恨言论正常化的意图不是支持仇恨,而是为用户提供一个走向非仇恨的垫脚石,同时给在线平台更多时间来监测用户行为的任何改善。为此,我们手工整理了一个平行文件――仇恨文本及其正常化对应方(普通文本不那么令人憎恶,更温和 ) 。我们引入了NACL,这是一个简单而高效的仇恨言论正常化模式,在三个阶段运行 — — 首先,它测量原始样本的仇恨情绪强度;第二,它确定了其中的仇恨范围;最后,它通过对仇恨范围进行翻转,同时让在线平台有更多时间来测量NACLL的功效。我们用三线评估(Interminical、Explorical和人文研究),我们观察到,NACLFCLF的精确度将AS的六度比值比值提高到了在线。