This chapter formulates seven lessons for preventing harm in artificial intelligence (AI) systems based on insights from the field of system safety for software-based automation in safety-critical domains. New applications of AI across societal domains and public organizations and infrastructures come with new hazards, which lead to new forms of harm, both grave and pernicious. The text addresses the lack of consensus for diagnosing and eliminating new AI system hazards. For decades, the field of system safety has dealt with accidents and harm in safety-critical systems governed by varying degrees of software-based automation and decision-making. This field embraces the core assumption of systems and control that AI systems cannot be safeguarded by technical design choices on the model or algorithm alone, instead requiring an end-to-end hazard analysis and design frame that includes the context of use, impacted stakeholders and the formal and informal institutional environment in which the system operates. Safety and other values are then inherently socio-technical and emergent system properties that require design and control measures to instantiate these across the technical, social and institutional components of a system. This chapter honors system safety pioneer Nancy Leveson, by situating her core lessons for today's AI system safety challenges. For every lesson, concrete tools are offered for rethinking and reorganizing the safety management of AI systems, both in design and governance. This history tells us that effective AI safety management requires transdisciplinary approaches and a shared language that allows involvement of all levels of society.
翻译:本章根据安全关键领域的软件自动化系统安全领域的洞察力,总结了防止人工智能系统伤害的七种经验教训; 在安全关键领域的软件自动化系统安全领域,从系统安全角度,提出了防止人工智能系统伤害的七种经验教训; 在社会领域和公共组织及基础设施中,AI的新应用带来了新的危害,导致出现新的严重和有害形式的危害; 案文讨论了在诊断和消除新的人工智能系统危害方面缺乏共识的问题; 几十年来,系统安全领域涉及由不同程度的基于软件的自动化和决策所制约的安全关键系统中的事故和伤害; 这一领域包含了系统和控制的核心假设,即光靠模型或算法的技术设计选择无法保障AI系统的安全,而需要从终端到终端的危险分析和设计框架,其中包括使用环境、受影响的利益攸关方以及系统运作的正式和非正式体制环境; 安全和其他价值是内在的社会技术和新兴系统特性,需要设计和控制措施,以便在系统的技术、社会、社会和机构组成部分中对这些特性进行回馈。 本章是安全系统的先驱Nancy Leveson, 其核心设计选择不能仅靠模型或算算法来维护其安全,而是在当今安全管理体系中提供安全管理方面的最佳做法。