While much attention has been paid to identifying explicit hate speech, implicit hateful expressions that are disguised in coded or indirect language are pervasive and remain a major challenge for existing hate speech detection systems. This paper presents the first attempt to apply Entity Linking (EL) techniques to both explicit and implicit hate speech detection, where we show that such real world knowledge about entity mentions in a text does help models better detect hate speech, and the benefit of adding it into the model is more pronounced when explicit entity triggers (e.g., rally, KKK) are present. We also discuss cases where real world knowledge does not add value to hate speech detection, which provides more insights into understanding and modeling the subtleties of hate speech.
翻译:虽然人们非常注意查明明确的仇恨言论,但以编码或间接语言伪装的隐含仇恨言论十分普遍,仍然是现有仇恨言论检测系统的一大挑战,本文首次尝试将实体联系技术用于明确和隐含的仇恨言论检测,我们在这里表明,在文本中提及的实体的真实世界知识有助于模型更好地检测仇恨言论,如果有明确的实体触发仇恨言论(如集会、KKK),将仇恨言论添加到模型中的好处就更加明显。我们还讨论现实世界知识不增加仇恨言论检测价值的案例,这为理解和模拟仇恨言论的微妙之处提供了更多见解。