以动态分布校准方式处理事件依赖的标签标签 (Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration) - 专知论文

会员服务 ·

0

噪声 · 标注 · 多元高斯分布 · 高斯分布 · 稳健性 ·

2022 年 10 月 11 日

Tackling Instance-Dependent Label Noise with Dynamic Distribution Calibration

翻译：以动态分布校准方式处理事件依赖的标签标签

Manyi Zhang,Yuxin Ren,Zihao Wang,Chun Yuan

from arxiv, Accepted at ACM MM2022

Instance-dependent label noise is realistic but rather challenging, where the label-corruption process depends on instances directly. It causes a severe distribution shift between the distributions of training and test data, which impairs the generalization of trained models. Prior works put great effort into tackling the issue. Unfortunately, these works always highly rely on strong assumptions or remain heuristic without theoretical guarantees. In this paper, to address the distribution shift in learning with instance-dependent label noise, a dynamic distribution-calibration strategy is adopted. Specifically, we hypothesize that, before training data are corrupted by label noise, each class conforms to a multivariate Gaussian distribution at the feature level. Label noise produces outliers to shift the Gaussian distribution. During training, to calibrate the shifted distribution, we propose two methods based on the mean and covariance of multivariate Gaussian distribution respectively. The mean-based method works in a recursive dimension-reduction manner for robust mean estimation, which is theoretically guaranteed to train a high-quality model against label noise. The covariance-based method works in a distribution disturbance manner, which is experimentally verified to improve the model robustness. We demonstrate the utility and effectiveness of our methods on datasets with synthetic label noise and real-world unknown noise.

翻译：标签依赖性标签的噪音是现实的,但相当具有挑战性,因为标签腐败过程直接取决于各种情况。它导致培训和测试数据分布之间的严重分配变化,从而损害经过培训的模型的普及性。先前的作品为解决这一问题付出了巨大的努力。不幸的是,这些作品总是高度依赖强势假设,或者在没有理论保证的情况下仍然偏执。在本文中,为了用依赖性标签的噪音解决学习的分布变化,采用了动态分布校正战略。具体地说,我们假设在培训数据被标签噪音腐蚀之前,每个班级都符合功能层面的多变数高斯的分布。 Label 噪音产生外推线以改变高斯分布。在培训期间,为了校准变化的分布,我们提出了两种方法,分别以多变数高斯分布的平均值和共变数为基础,我们提出了两种方法。基于平均值的方法以递增递增递减递增的维度的维度方法,在理论上保证对标签噪音进行高质的模型进行训练。基于差异的方法以分配干扰性的方法以改变高位分布的分布性方式工作,我们以试验性地验证了真实的噪音。

0

相关内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

近红外p-型染料敏化剂的合成及其光解水制氢性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

一类两相流的适定性问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Mumford-Shah型图像分割问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

P-糖蛋白介导的大黄素对有机磷农药在草鱼体内相互作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

锰铁酸钇基多铁性陶瓷的微观结构和电、磁性能

国家自然科学基金

0+阅读 · 2012年12月31日

镧系元素掺杂二氧化铪和超薄二氧化铪的总剂量效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

温度场分布对Spin-Seebeck效应的影响

国家自然科学基金

0+阅读 · 2011年12月31日

非光滑集值优化理论及其应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Arxiv

0+阅读 · 2022年11月16日

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

Arxiv

0+阅读 · 2022年11月16日

Weighted Sum-Rate Maximization With Causal Inference for Latent Interference Estimation

Arxiv

0+阅读 · 2022年11月15日

Distributed Data-Driven Predictive Control for Multi-Agent Collaborative Legged Locomotion

Arxiv

0+阅读 · 2022年11月13日

DNN Filter for Bias Reduction in Distribution-to-Distribution Scan Matching

Arxiv

0+阅读 · 2022年11月11日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Few-shot Learning with Noisy Labels

Arxiv

13+阅读 · 2022年4月12日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

VIP会员

文章信息

相关主题

多元高斯分布

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

50+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

163+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《俄乌战争中的无人系统：新的战争方式与新兴趋势——来自前线的印象》报告

《海上自主水面船舶远程操作中心：安全可持续运行的多维度分析》

多模态大语言模型下游调优中“保持自我”的重要性

隐身自主无人水下航行器技术如何变革水下作战并重塑海军竞争

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

SVLDL: Improved Speaker Age Estimation Using Selective Variance Label Distribution Learning

Arxiv

0+阅读 · 2022年11月16日

Knowledge Distillation for Detection Transformer with Consistent Distillation Points Sampling

Arxiv

0+阅读 · 2022年11月16日

Weighted Sum-Rate Maximization With Causal Inference for Latent Interference Estimation

Arxiv

0+阅读 · 2022年11月15日

Distributed Data-Driven Predictive Control for Multi-Agent Collaborative Legged Locomotion

Arxiv

0+阅读 · 2022年11月13日

DNN Filter for Bias Reduction in Distribution-to-Distribution Scan Matching

Arxiv

0+阅读 · 2022年11月11日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

Few-shot Learning with Noisy Labels

Arxiv

13+阅读 · 2022年4月12日

Learning Neural Models for Natural Language Processing in the Face of Distributional Shift

Arxiv

11+阅读 · 2021年9月3日

Semi-supervised Medical Image Segmentation through Dual-task Consistency

Arxiv

14+阅读 · 2020年9月9日

Distributed Graph Convolutional Networks

Arxiv

19+阅读 · 2020年7月13日

相关基金

近红外p-型染料敏化剂的合成及其光解水制氢性能研究

国家自然科学基金

0+阅读 · 2014年12月31日

一类两相流的适定性问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

Mumford-Shah型图像分割问题研究

国家自然科学基金

0+阅读 · 2013年12月31日

P-糖蛋白介导的大黄素对有机磷农药在草鱼体内相互作用研究

国家自然科学基金

0+阅读 · 2013年12月31日

锰铁酸钇基多铁性陶瓷的微观结构和电、磁性能

国家自然科学基金

0+阅读 · 2012年12月31日

镧系元素掺杂二氧化铪和超薄二氧化铪的总剂量效应研究

国家自然科学基金

0+阅读 · 2012年12月31日

高维数据的假设检验

国家自然科学基金

0+阅读 · 2012年12月31日

Witten Laplacian的特征值及与其相关的Ricci Soliton研究

国家自然科学基金

0+阅读 · 2012年12月31日

温度场分布对Spin-Seebeck效应的影响

国家自然科学基金

0+阅读 · 2011年12月31日

非光滑集值优化理论及其应用研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员