Politically sensitive topics are still a challenge for open-domain chatbots. However, dealing with politically sensitive content in a responsible, non-partisan, and safe behavior way is integral for these chatbots. Currently, the main approach to handling political sensitivity is by simply changing such a topic when it is detected. This is safe but evasive and results in a chatbot that is less engaging. In this work, as a first step towards a politically safe chatbot, we propose a group of metrics for assessing their political prudence. We then conduct political prudence analysis of various chatbots and discuss their behavior from multiple angles through our automatic metric and human evaluation metrics. The testsets and codebase are released to promote research in this area.
翻译:政治敏感议题仍然是开放的聊天场所的挑战。 但是,以负责、无党派和安全的行为方式处理政治敏感内容是这些聊天场所不可或缺的。 目前,处理政治敏感问题的主要办法是在发现时简单地改变这一话题。 这是安全的,但会回避,导致聊天场所较少参与。 在这项工作中,作为政治安全聊天场所的第一步,我们提出一组衡量标准,用于评估他们的政治谨慎性。我们然后对各种聊天场所进行政治审慎分析,并从多个角度讨论他们的行为,通过我们的自动衡量和人文评价衡量标准。为了促进这一领域的研究,将公布测试台和代码库。