We present an enrichment of the Hateval corpus of hate speech tweets (Basile et. al 2019) aimed to facilitate automated counter-narrative generation. Comparably to previous work (Chung et. al. 2019), manually written counter-narratives are associated to tweets. However, this information alone seems insufficient to obtain satisfactory language models for counter-narrative generation. That is why we have also annotated tweets with argumentative information based on Wagemanns (2016), that we believe can help in building convincing and effective counter-narratives for hate speech against particular groups. We discuss adequacies and difficulties of this annotation process and present several baselines for automatic detection of the annotated elements. Preliminary results show that automatic annotators perform close to human annotators to detect some aspects of argumentation, while others only reach low or moderate level of inter-annotator agreement.
翻译:与先前的工作(Chung等人,2019年)相比,人工撰写的反言论与推文相关,然而,这一信息本身似乎不足以为反言论生成获得令人满意的语言模式,因此,我们还有一份附有基于Wagemanns(2016年)的论证信息的附加说明的推文,我们认为这有助于为针对特定群体的仇恨言论建立令人信服的、有效的反言论。我们讨论了这一批注过程的恰当性和困难,并提出了自动检测附加说明要素的若干基线。初步结果显示,自动批注者在接近人类的批注者时,发现一些争论的方面,而另一些则只达到低度或中度的批发协议。