多语种仇恨检查:多语种仇恨言论检测模式功能测试 (Multilingual HateCheck: Functional Tests for Multilingual Hate Speech Detection Models)

Hate speech detection models are typically evaluated on held-out test sets. However, this risks painting an incomplete and potentially misleading picture of model performance because of increasingly well-documented systematic gaps and biases in hate speech datasets. To enable more targeted diagnostic insights, recent research has thus introduced functional tests for hate speech detection models. However, these tests currently only exist for English-language content, which means that they cannot support the development of more effective models in other languages spoken by billions across the world. To help address this issue, we introduce Multilingual HateCheck (MHC), a suite of functional tests for multilingual hate speech detection models. MHC covers 34 functionalities across ten languages, which is more languages than any other hate speech dataset. To illustrate MHC's utility, we train and test a high-performing multilingual hate speech detection model, and reveal critical model weaknesses for monolingual and cross-lingual applications.

翻译：然而,由于在仇恨言论数据集中日益有据可查的系统性差距和偏见,这有可能描绘出一种不完整和可能误导的模型性能图。为了能够更有针对性的诊断洞察,最近的研究因此引入了仇恨言论检测模型的功能测试。然而,这些测试目前只针对英语内容,这意味着它们无法支持以世界各地数十亿人所讲的其他语言开发更有效的模型。为了帮助解决这一问题,我们引入了多种语言仇恨检查(MHC),这是一套多语言仇恨言论检测模型的功能测试。MHC覆盖了十种语言的34个功能,这十种语言比任何其他仇恨言论数据集都多。为了说明MHC的效用,我们培训和测试了一个高效的多语言仇恨言论检测模型,并揭示了单语和跨语言应用的关键模型弱点。

相关内容

MoDELS

关注 43

ACM/IEEE第23届模型驱动工程语言和系统国际会议，是模型驱动软件和系统工程的首要会议系列，由ACM-SIGSOFT和IEEE-TCSE支持组织。自1998年以来，模型涵盖了建模的各个方面，从语言和方法到工具和应用程序。模特的参加者来自不同的背景，包括研究人员、学者、工程师和工业专业人士。MODELS 2019是一个论坛，参与者可以围绕建模和模型驱动的软件和系统交流前沿研究成果和创新实践经验。今年的版本将为建模社区提供进一步推进建模基础的机会，并在网络物理系统、嵌入式系统、社会技术系统、云计算、大数据、机器学习、安全、开源等新兴领域提出建模的创新应用以及可持续性。官网链接：http://www.modelsconference.org/

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日