Machine learning (ML) models are increasingly being used in application domains that often involve working together with human experts. In this context, it can be advantageous to defer certain instances to a single human expert when they are difficult to predict for the ML model. While previous work has focused on scenarios with one distinct human expert, in many real-world situations several human experts with varying capabilities may be available. In this work, we propose an approach that trains a classification model to complement the capabilities of multiple human experts. By jointly training the classifier together with an allocation system, the classifier learns to accurately predict those instances that are difficult for the human experts, while the allocation system learns to pass each instance to the most suitable team member -- either the classifier or one of the human experts. We evaluate our proposed approach in multiple experiments on public datasets with "synthetic" experts and a real-world medical dataset annotated by multiple radiologists. Our approach outperforms prior work and is more accurate than the best human expert or a classifier. Furthermore, it is flexibly adaptable to teams of varying sizes and different levels of expert diversity.
翻译:机械学习(ML)模型正越来越多地用于往往涉及与人类专家合作的应用领域。在这方面,在难以预测ML模型时,将某些实例推迟到某个单一的人类专家处理可能是有益的。虽然以前的工作侧重于与一位独特的人类专家的情景,但在许多现实世界中,可能有几位能力不同的人类专家。在这项工作中,我们提出一种方法,培训一种分类模型,以补充多个人类专家的能力。通过联合培训分类员和分配系统,分类员学会准确预测那些对人类专家来说困难的情况,而分配系统则学会将每个实例传递给最合适的小组成员 -- -- 分类员或一名人类专家。我们评估了在公共数据集方面与“合成”专家和由多个放射学家附加说明的真实世界医疗数据集的多重实验中的拟议做法。我们的方法超越了先前的工作,比最佳人类专家或分类员更准确。此外,我们的方法灵活地适应了不同规模和不同程度专家的团队。