建立一个专家附加说明的巴西关于仇恨言论和攻击性语言探测的Instagram评论公司 (Building an Expert Annotated Corpus of Brazilian Instagram Comments for Hate Speech and Offensive Language Detection)

The understanding of an offense is subjective and people may have different opinions about the offensiveness of a comment. Moreover, offenses and hate speech may occur through sarcasm, which hides the real intention of the comment and makes the decision of the annotators more confusing. Therefore, providing a well-structured annotation process is crucial to a better understanding of hate speech and offensive language phenomena, as well as supplying better performance for machine learning classifiers. In this paper, we describe a corpus annotation process proposed by a linguist, a hate speech specialist, and machine learning engineers in order to support the identification of hate speech and offensive language on social media. In addition, we provide the first robust dataset of this kind for the Brazilian Portuguese language. The corpus was collected from Instagram posts of political personalities and manually annotated, being composed by 7,000 annotated documents according to three different layers: a binary classification (offensive versus non-offensive language), the level of offense (highly offensive, moderately offensive, and slightly offensive messages), and the identification regarding the target of the discriminatory content (xenophobia, racism, homophobia, sexism, religious intolerance, partyism, apology to the dictatorship, antisemitism, and fatphobia). Each comment was annotated by three different annotators and achieved high inter-annotator agreement. The proposed annotation approach is also language and domain-independent nevertheless it is currently customized for Brazilian Portuguese.

翻译：对犯罪的理解是主观的,人们可能对评论的冒犯性有不同的看法。此外,犯罪和仇恨言论可能通过讽刺来发生,这掩盖了评论的真实意图,使注解员的决定更加混乱。因此,提供一个结构完善的注解过程对于更好地理解仇恨言论和冒犯性语言现象至关重要,并且为机器学习分类者提供更好的表现。在本文中,我们描述了语言学家、仇恨言论专家和机器学习工程师提议的一个内容批注过程,以支持识别仇恨言论和社交媒体上的冒犯性语言。此外,我们为巴西葡萄牙语提供了第一个这类类型的有力数据组。这套材料是从Instagram政治人物和手动附加说明性文章中收集的,由7 000份附加说明性文件组成,分为三个不同层次:二进制分类(冒犯性与非冒犯性语言)、提议的冒犯程度(高度冒犯性、中度攻击性和轻微攻击性信息),以及针对歧视性内容的识别(仇恨性、种族主义、仇视同性恋、仇视性恐惧性、性仇视性仇视性、宗教、宗教不宽容性言论)也是通过一种高调和高调制实现的。