Quite some real-world problems can be formulated as decision-making problems wherein one must repeatedly make an appropriate choice from a set of alternatives. Multiple expert judgements, whether human or artificial, can help in taking correct decisions, especially when exploration of alternative solutions is costly. As expert opinions might deviate, the problem of finding the right alternative can be approached as a collective decision making problem (CDM) via aggregation of independent judgements. Current state-of-the-art approaches focus on efficiently finding the optimal expert, and thus perform poorly if all experts are not qualified or if they are overly biased, thereby potentially derailing the decision-making process. In this paper, we propose a new algorithmic approach based on contextual multi-armed bandit problems (CMAB) to identify and counteract such biased expertise. We explore homogeneous, heterogeneous and polarised expert groups and show that this approach is able to effectively exploit the collective expertise, outperforming state-of-the-art methods, especially when the quality of the provided expertise degrades. Our novel CMAB-inspired approach achieves a higher final performance and does so while converging more rapidly than previous adaptive algorithms.
翻译:许多专家判断,无论是人为还是人为的专家判断,都有助于做出正确的决定,特别是在探索替代解决办法代价高昂的情况下。专家意见可能会有偏差,因此,找到正确的替代办法的问题可以作为一个集体决策问题来处理。目前最先进的方法侧重于高效率地找到最佳专家,因此,如果所有专家不合格或过于偏颇,则表现不佳,从而有可能破坏决策进程。在本文件中,我们提出了一种基于背景性多臂土匪问题的新算法方法,以查明和抵制这种偏颇的专门知识。我们探索了同质、多元和对立的专家组,并表明这一方法能够有效地利用集体专门知识,表现优于最先进的方法,特别是在所提供的专门知识质量下降时。我们新的CMAB启发性方法取得了更高的最终业绩,而且比以前的适应性算法更快。