Humans exhibit disagreement during data labeling. We term this disagreement as human label uncertainty. In this work, we study the ramifications of human label uncertainty (HLU). Our evaluation of existing uncertainty estimation algorithms, with the presence of HLU, indicates the limitations of existing uncertainty metrics and algorithms themselves in response to HLU. Meanwhile, we observe undue effects in predictive uncertainty and generalizability. To mitigate the undue effects, we introduce a novel natural scene statistics (NSS) based label dilution training scheme without requiring massive human labels. Specifically, we first select a subset of samples with low perceptual quality ranked by statistical regularities of images. We then assign separate labels to each sample in this subset to obtain a training set with diluted labels. Our experiments and analysis demonstrate that training with NSS-based label dilution alleviates the undue effects caused by HLU.
翻译:人类在数据标签中表现出分歧。 我们称这种分歧为人类标签不确定性。 在这项工作中,我们研究了人类标签不确定性(HLU)的影响。我们对现有不确定性估算算法的评估,加上HLU的存在,表明现有不确定性衡量法和算法本身在应对HLU方面的局限性。 同时,我们观察到预测不确定性和可概括性方面的不当影响。为了减轻不当影响,我们引入了新的自然场景统计(NSS)基于标签稀释培训计划,而不需要大量的人类标签。具体地说,我们首先选择了按图像统计规律排列的低认知质量样本。我们随后为这一组样本中的每个样本指定了单独的标签,以获得带有稀释标签的培训。我们的实验和分析表明,使用以NSS为基础的标签稀释培训减轻了HLU造成的不当影响。