Quality control is an ongoing concern in citizen science that is often managed by replication to consensus in online tasks such as image classification. Numerous factors can lead to disagreement, including image quality problems, interface specifics, and the complexity of the content itself. We conducted trace ethnography with statistical and qualitative analyses of six Snapshot Safari projects to understand the content characteristics that can lead to uncertainty and low consensus. This study contributes content categorization based on aggregate classifications to characterize image complexity, with analysis that confirms that the categories impact classification efficiency, and an inductively generated set of additional image quality issues that also impact volunteers' ability to confidently classify content. The results suggest that different conceptualizations and measures of consensus may be needed for different types of content, and aggregate responses offer a way to identify content that needs different handling when complexity cannot be determined $a$ $priori$.
翻译:质量控制是公民科学的一个长期问题,经常通过复制来管理公民科学,在图像分类等在线任务中达成共识。许多因素可能导致分歧,包括图像质量问题、界面特征和内容本身的复杂性。我们对6个“快照”Safari项目进行了统计和定性分析,对6个“快照”Safari项目进行了跟踪人种学,以了解可能导致不确定性和低共识的内容特征。这项研究根据综合分类对内容进行了分类,以描述图像复杂性的特点。该研究通过分析确认类别影响分类效率,并隐含产生了一系列额外的图像质量问题,这也影响到志愿人员对内容进行自信分类的能力。结果表明,不同类型内容可能需要不同的概念化和共识措施,综合回应提供了在无法确定复杂程度时确定需要不同处理的内容的方法。