Hybrid human-ML systems are increasingly in charge of consequential decisions in a wide range of domains. A growing body of work has advanced our understanding of these systems by providing empirical and theoretical analyses. However, existing empirical results are mixed, and theoretical proposals are often incompatible with each other. Our goal in this work is to bring much-needed organization to this field by offering a unifying framework for understanding conditions under which combining complementary strengths of human and ML leads to higher quality decisions than those produced by them individually -- a state to which we refer to as human-ML complementarity. We focus specifically on the context of human-ML predictive decision-making systems and investigate optimal ways of combining human and ML-based predictive decisions, accounting for the underlying causes of variation in their judgments. Within this scope, we present two crucial contributions. First, drawing upon prior literature in human psychology, machine learning, and human-computer interaction, we introduce a taxonomy characterizing a wide variety of criteria across which human and machine decision-making differ. Building on our taxonomy, our second contribution presents a unifying optimization-based framework for formalizing how human and ML predictive decisions should be aggregated optimally. We show that our proposed framework encompasses several existing models of human-ML complementarity as special cases. Last but not least, the exploratory analysis of our framework offers a critical piece of insight for future work in this area: the mechanism by which we combine human-ML judgments should be informed by the underlying causes of their diverging decisions.
翻译:越来越多的工作通过提供实证和理论分析,增进了我们对这些系统的了解。然而,现有的实证结果参差不齐,理论建议往往互不相干。我们这项工作的目标是提供一个统一的框架,以便了解人类和ML的互补优势导致作出比个别决定更高质量的决定的条件,我们称之为人与ML的互补,我们称之为人与ML的互补。我们特别侧重于人与ML的预测决策系统的背景,并研究将基于人和ML的预测决定结合起来的最佳方法,说明其判断差异的根本原因。在此范围内,我们提出两项关键的贡献。首先,我们借鉴人类心理学、机器学习和人-计算机互动方面的先前文献,引入一种分类学,将人类和机器决策各不相同的各种标准定性为不同的标准。我们从我们的分类学出发,我们的第二项贡献提出了一个统一的优化框架,以正式确定人与ML的预测性决定应当如何与基于人与ML的预测性决定相结合,同时说明其判断的深层次原因。我们在此范围内提出了两项关键的贡献。我们提出的框架是:在人类心理学、机器学习以及人与机器相互作用方面作出最后的判断。