Although virtue ethics has repeatedly been proposed as a suitable framework for the development of artificial moral agents (AMAs), it has been proven difficult to approach from a computational perspective. In this work, we present the first technical implementation of artificial virtuous agents (AVAs) in moral simulations. First, we review previous conceptual and technical work in artificial virtue ethics and describe a functionalistic path to AVAs based on dispositional virtues, bottom-up learning, and top-down eudaimonic reward. We then provide the details of a technical implementation in a moral simulation based on a tragedy of the commons scenario. The experimental results show how the AVAs learn to tackle cooperation problems while exhibiting core features of their theoretical counterpart, including moral character, dispositional virtues, learning from experience, and the pursuit of eudaimonia. Ultimately, we argue that virtue ethics provides a compelling path toward morally excellent machines and that our work provides an important starting point for such endeavors.
翻译:尽管美德道德一再被提议作为发展人造道德物剂的适当框架,但事实证明很难从计算的角度来看待。在这项工作中,我们在道德模拟中首次介绍了人造道德品剂的技术应用。首先,我们审查了以前在人造道德道德伦理方面的概念和技术工作,并描述了在处置美德、自下而上的学习和自上而下的经济奖赏基础上通往人造道德的实用道路。然后,我们根据共同事物的悲剧,在道德模拟中提供技术实施的细节。实验结果表明,人造道德品剂学会如何在展示其理论对应方的核心特征,包括道德性、品德、经验学习和追求尤迪莫尼亚等核心特征的同时,解决合作问题。最后,我们说,美德道德提供了一条通往道德上极好的机器的有力途径,我们的工作为这种努力提供了一个重要的起点。