In this paper, we argue that the way we have been training and evaluating ML models has largely forgotten the fact that they are applied in an organization or societal context as they provide value to people. We show that with this perspective we fundamentally change how we evaluate, select and deploy ML models - and to some extent even what it means to learn. Specifically, we stress that the notion of value plays a central role in learning and evaluating, and different models may require different learning practices and provide different values based on the application context they are applied. We also show that this concretely impacts how we select and embed models into human workflows based on experimental datasets. Nothing of what is presented here is hard: to a large extent is a series of fairly trivial observations with massive practical implications.
翻译:在本文中,我们争论说,我们一直在培训和评价多边贷款模式的方式在很大程度上忘记了这样一个事实,即这些模式应用于一个组织或社会环境,因为它们为人们提供了价值。我们从这个角度表明,我们从根本上改变了我们如何评价、选择和部署多边贷款模式的方式,在某种程度上甚至意味着什么。具体地说,我们强调,价值概念在学习和评价中发挥着核心作用,不同的模式可能需要不同的学习做法,并根据应用环境提供不同的价值。我们还表明,这种具体影响如何选择模型并将其嵌入基于实验数据集的人类工作流程。这里所介绍的没有什么是困难的:在很大程度上是一系列具有大规模实际影响的相当微不足道的观察。