As machine learning (ML) increasingly affects people and society, awareness of its potential unwanted consequences has also grown. To anticipate, prevent, and mitigate undesirable downstream consequences, it is critical that we understand when and how harm might be introduced throughout the ML life cycle. In this paper, we provide a framework that identifies seven distinct potential sources of downstream harm in machine learning, spanning data collection, development, and deployment. In doing so, we aim to facilitate more productive and precise communication around these issues, as well as more direct, application-grounded ways to mitigate them.
翻译:随着机器学习(ML)日益影响人民和社会,人们也日益认识到机器学习的潜在不想要的后果,为了预测、预防和减轻不可取的下游后果,我们必须了解何时以及如何在整个机器学习生命周期中造成伤害,在本文件中,我们提供了一个框架,确定在机器学习、数据收集、开发和部署方面造成下游伤害的7个明显的潜在来源。我们这样做的目的是促进围绕这些问题进行更有成效和准确的沟通,以及采取更直接和基于应用的方法来减轻这些危害。