Crowdsourcing has emerged as a popular approach for collecting annotated data to train supervised machine learning models. However, annotator bias can lead to defective annotations. Though there are a few works investigating individual annotator bias, the group effects in annotators are largely overlooked. In this work, we reveal that annotators within the same demographic group tend to show consistent group bias in annotation tasks and thus we conduct an initial study on annotator group bias. We first empirically verify the existence of annotator group bias in various real-world crowdsourcing datasets. Then, we develop a novel probabilistic graphical framework GroupAnno to capture annotator group bias with a new extended Expectation Maximization (EM) training algorithm. We conduct experiments on both synthetic and real-world datasets. Experimental results demonstrate the effectiveness of our model in modeling annotator group bias in label aggregation and model learning over competitive baselines.
翻译:在收集附加说明的数据以培训受监督的机器学习模式方面,众包已成为一种受欢迎的方法,用于收集附加说明的数据,但说明偏差可能导致说明有缺陷。虽然在调查个别说明的偏差方面做了一些工作,但批注者对群体的影响基本上被忽视。在这项工作中,我们发现同一人口组内的批注者往往在批注任务方面表现出一贯的团体偏见,因此我们首先对批注者群体偏差进行了初步研究。我们首先从经验上核实了各种真实世界的众包数据集中是否存在批注者群体偏差。然后,我们开发了一个新的概率性图形框架组Anno,以捕捉批注者群体的偏差,采用新的扩展预期最大化培训算法。我们同时对合成和现实世界数据集进行实验。实验结果表明,我们的模型在标注组在标签汇总和模型学习方面对竞争性基线的偏差方面是有效的。