在不平衡的数据集上对标签噪声进行不确定性软件学习 (Uncertainty-Aware Learning Against Label Noise on Imbalanced Datasets) - 专知论文

会员服务 ·

0

噪声 · Performer · Learning · 标注 · MoDELS ·

2022 年 7 月 12 日

Uncertainty-Aware Learning Against Label Noise on Imbalanced Datasets

翻译：在不平衡的数据集上对标签噪声进行不确定性软件学习

Yingsong Huang,Bing Bai,Shengwei Zhao,Kun Bai,Fei Wang

Learning against label noise is a vital topic to guarantee a reliable performance for deep neural networks. Recent research usually refers to dynamic noise modeling with model output probabilities and loss values, and then separates clean and noisy samples. These methods have gained notable success. However, unlike cherry-picked data, existing approaches often cannot perform well when facing imbalanced datasets, a common scenario in the real world. We thoroughly investigate this phenomenon and point out two major issues that hinder the performance, i.e., \emph{inter-class loss distribution discrepancy} and \emph{misleading predictions due to uncertainty}. The first issue is that existing methods often perform class-agnostic noise modeling. However, loss distributions show a significant discrepancy among classes under class imbalance, and class-agnostic noise modeling can easily get confused with noisy samples and samples in minority classes. The second issue refers to that models may output misleading predictions due to epistemic uncertainty and aleatoric uncertainty, thus existing methods that rely solely on the output probabilities may fail to distinguish confident samples. Inspired by our observations, we propose an Uncertainty-aware Label Correction framework~(ULC) to handle label noise on imbalanced datasets. First, we perform epistemic uncertainty-aware class-specific noise modeling to identify trustworthy clean samples and refine/discard highly confident true/corrupted labels. Then, we introduce aleatoric uncertainty in the subsequent learning process to prevent noise accumulation in the label noise modeling process. We conduct experiments on several synthetic and real-world datasets. The results demonstrate the effectiveness of the proposed method, especially on imbalanced datasets.

翻译：针对标签噪音的学习是保证深层神经网络可靠性能的一个重要议题。最近的研究通常是指动态噪音模型,以模型产出的不确定性概率和损失值来进行动态噪音模型,然后将清洁和吵闹的样本分离出来。这些方法取得了显著的成功。然而,与樱桃所选数据不同,现有方法在面对不平衡的数据集时往往无法很好地发挥作用,这是现实世界中常见的场景。我们彻底调查了这一现象,并指出了妨碍性能的两大问题,即:噪音的累积性(emph{跨类损失分布差异 ) 和 memph{误差预测由于不确定性而导致的不确定性。第一个问题是,现有方法往往使用模型的不确定性模型和误差的预测值。现有的方法可能无法辨别可靠的样本。然而,在我们的观察中,损失分布显示阶级不平衡的类别之间存在很大的差异,而等级噪音模型的模型则很容易被混杂地混淆。

0

相关内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

REGγ在多发性骨髓瘤中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuO(Cu2O)-ZnO-Ag纳米线中的等离激元能量转移增强光电转换研究

国家自然科学基金

0+阅读 · 2013年12月31日

多源遥感数据地表BRDF/反照率联合反演方法及试验验证

国家自然科学基金

0+阅读 · 2012年12月31日

界面调制对铁磁金属自旋注入效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

离子注入制备BiFeO3/ZnO/graphene多铁性器件

国家自然科学基金

0+阅读 · 2012年12月31日

渗流型铁电铁磁复相高性能吸波材料的制备与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

超冷极性分子纠缠态的制备及调控

国家自然科学基金

0+阅读 · 2012年12月31日

PTPMeg2调控STAT3活性的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

miR-124和miR-27对阿尔茨海默病BACE1基因影响的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

大视场双螺旋锥束CT扫描与重建

国家自然科学基金

0+阅读 · 2009年12月31日

Joint Debiased Representation Learning and Imbalanced Data Clustering

Arxiv

0+阅读 · 2022年9月6日

A Robust Learning Methodology for Uncertainty-aware Scientific Machine Learning models

Arxiv

0+阅读 · 2022年9月5日

ScaleFace: Uncertainty-aware Deep Metric Learning

Arxiv

0+阅读 · 2022年9月5日

Data Provenance via Differential Auditing

Arxiv

1+阅读 · 2022年9月4日

Uncertainty Sets for Image Classifiers using Conformal Prediction

Arxiv

0+阅读 · 2022年9月3日

Unsupervised Joint Image Transfer and Uncertainty Quantification Using Patch Invariant Networks

Arxiv

0+阅读 · 2022年9月2日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

VIP会员

文章信息

相关主题

相关VIP内容

50+篇《神经架构搜索NAS》2020论文合集

专知会员服务

61+阅读 · 2020年3月19日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向性能、成本效益、云边隐私与可信性的大小语言模型协作综述

乌克兰太空研究（2022-2024年） | 176页

【CMU博士论文】大型语言模型的隐性特性

国防领域人工智能走向何方？

相关资讯

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

相关论文

Joint Debiased Representation Learning and Imbalanced Data Clustering

Arxiv

0+阅读 · 2022年9月6日

A Robust Learning Methodology for Uncertainty-aware Scientific Machine Learning models

Arxiv

0+阅读 · 2022年9月5日

ScaleFace: Uncertainty-aware Deep Metric Learning

Arxiv

0+阅读 · 2022年9月5日

Data Provenance via Differential Auditing

Arxiv

1+阅读 · 2022年9月4日

Uncertainty Sets for Image Classifiers using Conformal Prediction

Arxiv

0+阅读 · 2022年9月3日

Unsupervised Joint Image Transfer and Uncertainty Quantification Using Patch Invariant Networks

Arxiv

0+阅读 · 2022年9月2日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Active Learning for Domain Adaptation: An Energy-based Approach

Arxiv

13+阅读 · 2021年12月2日

Contrastive learning of global and local features for medical image segmentation with limited annotations

Arxiv

19+阅读 · 2020年6月18日

Class-Balanced Loss Based on Effective Number of Samples

Arxiv

12+阅读 · 2019年1月16日

相关基金

REGγ在多发性骨髓瘤中的作用及分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

CuO(Cu2O)-ZnO-Ag纳米线中的等离激元能量转移增强光电转换研究

国家自然科学基金

0+阅读 · 2013年12月31日

多源遥感数据地表BRDF/反照率联合反演方法及试验验证

国家自然科学基金

0+阅读 · 2012年12月31日

界面调制对铁磁金属自旋注入效应的影响

国家自然科学基金

0+阅读 · 2012年12月31日

离子注入制备BiFeO3/ZnO/graphene多铁性器件

国家自然科学基金

0+阅读 · 2012年12月31日

渗流型铁电铁磁复相高性能吸波材料的制备与性能研究

国家自然科学基金

0+阅读 · 2012年12月31日

超冷极性分子纠缠态的制备及调控

国家自然科学基金

0+阅读 · 2012年12月31日

PTPMeg2调控STAT3活性的分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

miR-124和miR-27对阿尔茨海默病BACE1基因影响的分子机制

国家自然科学基金

0+阅读 · 2011年12月31日

大视场双螺旋锥束CT扫描与重建

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员