The noise transition matrix plays a central role in the problem of learning with noisy labels. Among many other reasons, a large number of existing solutions rely on access to it. Identifying and estimating the transition matrix without ground truth labels is a critical and challenging task. When label noise transition depends on each instance, the problem of identifying the instance-dependent noise transition matrix becomes substantially more challenging. Despite recent works proposing solutions for learning from instance-dependent noisy labels, the field lacks a unified understanding of when such a problem remains identifiable. The goal of this paper is to characterize the identifiability of the label noise transition matrix. Building on Kruskal's identifiability results, we are able to show the necessity of multiple noisy labels in identifying the noise transition matrix for the generic case at the instance level. We further instantiate the results to explain the successes of the state-of-the-art solutions and how additional assumptions alleviated the requirement of multiple noisy labels. Our result also reveals that disentangled features are helpful in the above identification task and we provide empirical evidence.
翻译:噪声过渡矩阵在学习噪音标签问题上起着核心作用。除其他原因外,许多现有解决方案都依赖于使用它。识别和估算没有地面真相标签的过渡矩阵是一项关键而具有挑战性的任务。当标签噪音过渡取决于每个实例时,识别取决于实例的噪音过渡矩阵的问题变得更具挑战性。尽管最近的工作提出了从依赖实例的噪音标签学习的解决方案,但实地缺乏对何时仍能识别此类问题的统一理解。本文件的目的是说明标签噪音过渡矩阵的可识别性。以Kruskal的可识别性结果为基础,我们能够显示在为普通案例一级确定噪音过渡矩阵时必须使用多条噪音标签。我们进一步对结果进行速记,以解释最新解决方案的成功之处,并说明其他假设如何减轻了对多条噪音标签的要求。我们的结果还表明,分解的特征有助于上述识别任务,我们提供了经验证据。