If autonomous AI systems are to be reliably safe in novel situations, they will need to incorporate general principles guiding them to recognize and avoid harmful behaviours. Such principles may need to be supported by a binding system of regulation, which would need the underlying principles to be widely accepted. They should also be specific enough for technical implementation. Drawing inspiration from law, this article explains how negative human rights could fulfil the role of such principles and serve as a foundation both for an international regulatory system and for building technical safety constraints for future AI systems.
翻译:如果自主人工智能系统在新情况下要可靠地安全,它们将需要融合引导其识别和避免有害行为的一般性原则。这些原则可能需要得到一套约束性的监管体系的支持,该监管体系需要被广泛接受的基础原则支撑,并且应该足够具体以便实现技术。本文从法律方面汲取启示,解释了如何通过负人权来实现这些原则的作用,既可以为国际监管体系奠定基础,也可以为未来人工智能系统建立技术安全约束。