在人群序列标签中的真相发现 (Truth Discovery in Sequence Labels from Crowds)

Annotations quality and quantity positively affect the performance of sequence labeling, a vital task in Natural Language Processing. Hiring domain experts to annotate a corpus set is very costly in terms of money and time. Crowdsourcing platforms, such as Amazon Mechanical Turk (AMT), have been deployed to assist in this purpose. However, these platforms are prone to human errors due to the lack of expertise; hence, one worker's annotations cannot be directly used to train the model. Existing literature in annotation aggregation more focuses on binary or multi-choice problems. In recent years, handling the sequential label aggregation tasks on imbalanced datasets with complex dependencies between tokens has been challenging. To conquer the challenge, we propose an optimization-based method that infers the best set of aggregated annotations using labels provided by workers. The proposed Aggregation method for Sequential Labels from Crowds ($AggSLC$) jointly considers the characteristics of sequential labeling tasks, workers' reliabilities, and advanced machine learning techniques. We evaluate $AggSLC$ on different crowdsourced data for Named Entity Recognition (NER), Information Extraction tasks in biomedical (PICO), and the simulated dataset. Our results show that the proposed method outperforms the state-of-the-art aggregation methods. To achieve insights into the framework, we study $AggSLC$ components' effectiveness through ablation studies by evaluating our model in the absence of the prediction module and inconsistency loss function. Theoretical analysis of our algorithm's convergence points that the proposed $AggSLC$ halts after a finite number of iterations.

翻译：说明质量和数量对序列标签的性能产生了积极影响,这是自然语言处理中一项至关重要的任务。雇用域专家对一组数据进行批注在资金和时间上都非常昂贵。已经为此部署了亚马逊机械土耳其(AMT)等众包平台。但是,由于缺乏专业知识,这些平台容易出现人为错误; 因此, 无法直接使用一位工人的注释来培训模型。说明中的现有文献更多地侧重于二进制或多选制问题。近年来, 处理关于不同数据群集的顺序标签汇总任务,在各种符号之间复杂的依赖性之间,费用非常昂贵。为了克服挑战,我们建议了一种基于优化的方法,利用工人提供的标签来推断最佳的一组汇总说明。提议的对来自C的序列标签的分类方法(AggSLC$ ) 联合考虑顺序标签任务的特性、工人的稳定性, 以及先进的机器学习技术。我们评估了用于NASLA的组别数据组合组合组合组合组合组合, 并用ANERSLA的数值分析方法, 以模拟方式完成我们企业的模型分析结果。