This paper summarizes and evaluates various approaches, methods, and techniques for pursuing fairness in artificial intelligence (AI) systems. It examines the merits and shortcomings of these measures and proposes practical guidelines for defining, measuring, and preventing bias in AI. In particular, it cautions against some of the simplistic, yet common, methods for evaluating bias in AI systems, and offers more sophisticated and effective alternatives. The paper also addresses widespread controversies and confusions in the field by providing a common language among different stakeholders of high-impact AI systems. It describes various trade-offs involving AI fairness, and provides practical recommendations for balancing them. It offers techniques for evaluating the costs and benefits of fairness targets, and defines the role of human judgment in setting these targets. This paper provides discussions and guidelines for AI practitioners, organization leaders, and policymakers, as well as various links to additional materials for a more technical audience. Numerous real-world examples are provided to clarify the concepts, challenges, and recommendations from a practical perspective.
翻译:本文件总结并评价了在人工智能系统中追求公平的各种办法、方法和技术,审查了这些措施的优缺点,提出了界定、衡量和防止AI中偏见的实用准则,尤其告诫不要使用一些简单而常见的方法来评价AI系统中的偏见,并提出了更精密、更有效的替代办法;本文件还通过提供高影响力AI系统不同利益攸关方的共同语言,处理该领域的广泛争议和混乱问题;介绍了涉及AI公平的各种权衡,并为平衡这些权衡提出了切实可行的建议;为评估公平目标的成本和效益提供了技术,并界定了人类判断在确定这些目标方面的作用;本文件为AI从业人员、组织领导人和决策者提供了讨论和准则,以及为技术含量更高的受众提供了与补充材料的各种联系;提供了许多真实世界的实例,以便从实际角度澄清概念、挑战和建议。