Robust learning methods aim to learn a clean target distribution from noisy and corrupted training data where a specific corruption pattern is often assumed a priori. Our proposed method can not only successfully learn the clean target distribution from a dirty dataset but also can estimate the underlying noise pattern. To this end, we leverage a mixture-of-experts model that can distinguish two different types of predictive uncertainty, aleatoric and epistemic uncertainty. We show that the ability to estimate the uncertainty plays a significant role in elucidating the corruption patterns as these two objectives are tightly intertwined. We also present a novel validation scheme for evaluating the performance of the corruption pattern estimation. Our proposed method is extensively assessed in terms of both robustness and corruption pattern estimation through a number of domains, including computer vision and natural language processing.
翻译:我们建议的方法不仅能够成功地从肮脏的数据集中学习干净的目标分布,而且可以估计潜在的噪音模式。为此,我们利用专家混合模型,可以区分两种不同的预测不确定性,即疏通性和认知性不确定性。我们表明,评估不确定性的能力在澄清腐败模式方面起着重要作用,因为这两个目标密切相关。我们还提出了一个新的验证计划,用以评价腐败模式估计的绩效。我们提议的方法通过若干领域,包括计算机视野和自然语言处理,从稳健性和腐败模式估测的角度进行了广泛评估。