Norm discovery is important for understanding and reasoning about the acceptable behaviors and potential violations in human communication and interactions. We introduce NormSage, a framework for addressing the novel task of conversation-grounded multi-lingual, multi-cultural norm discovery, based on language model prompting and self-verification. NormSAGE leverages the expressiveness and implicit knowledge of the pretrained GPT-3 language model backbone, to elicit knowledge about norms through directed questions representing the norm discovery task and conversation context. It further addresses the risk of language model hallucination with a self-verification mechanism ensuring that the norms discovered are correct and are substantially grounded to their source conversations. Evaluation results show that our approach discovers significantly more relevant and insightful norms for conversations on-the-fly compared to baselines (>10+% in Likert scale rating). The norms discovered from Chinese conversation are also comparable to the norms discovered from English conversation in terms of insightfulness and correctness (<3% difference). In addition, the culture-specific norms are promising quality, allowing for 80% accuracy in culture pair human identification. Finally, our grounding process in norm discovery self-verification can be extended for instantiating the adherence and violation of any norm for a given conversation on-the-fly, with explainability and transparency. NormSAGE achieves an AUC of 95.4% in grounding, with natural language explanation matching human-written quality.
翻译:对理解和推理人类交流和互动中的可接受行为和潜在侵犯行为而言,诺姆发现非常重要。我们引入了诺姆Sage(NormSage),这是应对基于对话的多语言、多文化规范发现这一新任务的框架,以语言模式促进和自我验证为基础。诺姆SAGE利用了经过预先训练的GPT-3语言模型主干体的清晰和隐含知识,通过代表常规发现任务和对话背景的定向问题,获得关于规范的知识。它进一步用自我核实机制解决语言模式幻觉的风险,确保所发现的准则正确,并在很大程度上根植于其源性对话。评价结果表明,我们的方法发现,与基线相比,在空中对话中发现的相关性和深刻得多(>10 ⁇ 在Irrirt等级评级中)。 从中国对话中发现的规范也与英语对话中发现的清晰和正确性规范( < 3%的差异) 相仿。此外,具体文化规范的质量是很有希望的,允许在文化识别方面达到80%的准确性。最后,我们在标准发现自我核实过程中发现自我核实的规范过程可以扩展为可追溯性解释的准确性。95度,对原则进行精确性解释。