Hate speech has grown significantly on social media, causing serious consequences for victims of all demographics. Despite much attention being paid to characterize and detect discriminatory speech, most work has focused on explicit or overt hate speech, failing to address a more pervasive form based on coded or indirect language. To fill this gap, this work introduces a theoretically-justified taxonomy of implicit hate speech and a benchmark corpus with fine-grained labels for each message and its implication. We present systematic analyses of our dataset using contemporary baselines to detect and explain implicit hate speech, and we discuss key features that challenge existing models. This dataset will continue to serve as a useful benchmark for understanding this multifaceted issue.
翻译:仇恨言论在社交媒体上明显增加,给所有人口统计的受害者造成了严重后果。尽管人们非常关注歧视性言论的特征特征和检测,但大多数工作的重点是明确或公开的仇恨言论,未能解决基于编码或间接语言的更为普遍的形式问题。为填补这一空白,这项工作引入了一种理论上合理的隐含仇恨言论分类法,并引入了对每条信息及其含义贴有细微标签的基准程序。我们利用当代基线对数据集进行系统分析,以发现和解释隐含的仇恨言论,我们讨论了挑战现有模式的主要特征。这一数据集将继续作为了解这一多层面问题的有用基准。