Context is vital for commonsense moral reasoning. "Lying to a friend" is wrong if it is meant to deceive them, but may be morally okay if it is intended to protect them. Such nuanced but salient contextual information can potentially flip the moral judgment of an action. Thus, we present ClarifyDelphi, an interactive system that elicits missing contexts of a moral situation by generating clarification questions such as "Why did you lie to your friend?". Our approach is inspired by the observation that questions whose potential answers lead to diverging moral judgments are the most informative. We learn to generate questions using Reinforcement Learning, by maximizing the divergence between moral judgements of hypothetical answers to a question. Human evaluation shows that our system generates more relevant, informative and defeasible questions compared to other question generation baselines. ClarifyDelphi assists informed moral reasoning processes by seeking additional morally consequential context to disambiguate social and moral situations.
翻译:对于常识道德推理来说,背景是十分重要的。 “ 欺骗朋友”是错的,如果是为了欺骗他们,但如果是为了保护他们,则可能在道德上是错的。 这种细微但突出的背景资料有可能推翻行动的道德判断。 因此,我们提出了Clarify Delphi, 这是一种互动系统,它通过提出“你为什么要欺骗朋友”这样的澄清问题,来导致道德状况的缺失。我们的方法受到以下观点的启发,即潜在答案导致道德判断不一致的问题最为丰富。我们学会通过强化学习产生问题,通过最大限度地扩大对问题的假设答案的道德判断之间的分歧。人类评估表明,我们的系统与其他问题代基线相比,产生了更相关、更丰富、更不可行的问题。Clarify Delphi通过寻求额外的道德相关背景来解析社会和道德状况,从而协助知情的道德推理过程。