Recent studies have alarmed that many online hate speeches are implicit. With its subtle nature, the explainability of the detection of such hateful speech has been a challenging problem. In this work, we examine whether ChatGPT can be used for providing natural language explanations (NLEs) for implicit hateful speech detection. We design our prompt to elicit concise ChatGPT-generated NLEs and conduct user studies to evaluate their qualities by comparison with human-written NLEs. We discuss the potential and limitations of ChatGPT in the context of implicit hateful speech research.
翻译:最近的研究震惊地发现,许多网上仇恨言论是隐含的。 由于其微妙的性质,发现这种仇恨言论的可解释性一直是一个具有挑战性的问题。 在这项工作中,我们研究查特格普特是否可以用来为隐含的仇恨言论检测提供自然语言解释(NLES ) 。 我们设计迅速的办法来获取简明的查特格普特生成的NLES,并进行用户研究,与人写的NLES相比,评估其品质。 我们讨论了查特格普特在隐含的仇恨言论研究中的潜力和局限性。</s>