The concerning rise of hateful content on online platforms has increased the attention towards automatic hate speech detection, commonly formulated as a supervised classification task. State-of-the-art deep learning-based approaches usually require a substantial amount of labeled resources for training. However, annotating hate speech resources is expensive, time-consuming, and often harmful to the annotators. This creates a pressing need to transfer knowledge from the existing labeled resources to low-resource hate speech corpora with the goal of improving system performance. For this, neighborhood-based frameworks have been shown to be effective. However, they have limited flexibility. In our paper, we propose a novel training strategy that allows flexible modeling of the relative proximity of neighbors retrieved from a resource-rich corpus to learn the amount of transfer. In particular, we incorporate neighborhood information with Optimal Transport, which permits exploiting the geometry of the data embedding space. By aligning the joint embedding and label distributions of neighbors, we demonstrate substantial improvements over strong baselines, in low-resource scenarios, on different publicly available hate speech corpora.
翻译:网上平台上仇恨内容的上升增加了对自动识别仇恨言论的关注,通常作为一种监督的分类任务。最先进的深层次学习方法通常需要大量贴标签的培训资源。然而,批注仇恨言论资源昂贵、耗时且往往有害于告示员。这造成迫切需要将知识从现有的标签资源转移到低资源仇恨言论内容公司,以改善系统性能为目标。为此,以街区为基础的框架已证明是有效的,但灵活性有限。在我们的论文中,我们提出了一个新颖的培训战略,允许对从资源丰富的设施中回收的邻居相对相邻的距离进行灵活建模,以了解转移的数量。特别是,我们将邻里信息与最佳运输公司合并,允许利用数据嵌入空间的几何方法。我们通过将邻居的联合嵌入和标签分布与改善系统性能的目标相匹配,我们展示了在各种公开的仇恨言论公司中,在低资源假设的强基线上的巨大改进。