理解事件一级的标签噪音:分离的影响和处理 (Understanding Instance-Level Label Noise: Disparate Impacts and Treatments) - 专知论文

会员服务 ·

0

可理解性 · 示例 · 标注 · 噪声 · Neural Networks ·

2021 年 7 月 13 日

Understanding Instance-Level Label Noise: Disparate Impacts and Treatments

翻译：理解事件一级的标签噪音:分离的影响和处理

from arxiv, Accepted to ICML 2021 as a long talk paper

This paper aims to provide understandings for the effect of an over-parameterized model, e.g. a deep neural network, memorizing instance-dependent noisy labels. We first quantify the harms caused by memorizing noisy instances, and show the disparate impacts of noisy labels for sample instances with different representation frequencies. We then analyze how several popular solutions for learning with noisy labels mitigate this harm at the instance level. Our analysis reveals that existing approaches lead to disparate treatments when handling noisy instances. While higher-frequency instances often enjoy a high probability of an improvement by applying these solutions, lower-frequency instances do not. Our analysis reveals new understandings for when these approaches work, and provides theoretical justifications for previously reported empirical observations. This observation requires us to rethink the distribution of label noise across instances and calls for different treatments for instances in different regimes.

翻译：本文旨在提供对过度参数化模型影响的理解,例如深神经网络、记忆以实例为依存的吵闹标签。我们首先量化由记忆噪音事件造成的伤害,并用不同代表频率来显示噪音标签对抽样案例的不同影响。然后我们分析用噪音标签学习的几种流行解决方案如何在实例一级减轻这种伤害。我们的分析显示,在处理吵闹事件时,现有方法导致不同的治疗。高频案例往往通过应用这些解决方案而有很大的改进可能性,但低频率案例却并非如此。我们的分析揭示了这些方法发挥作用时的新理解,并为先前报告的经验性观察提供了理论依据。观察要求我们重新考虑标签噪音在不同案例中的分布,并要求在不同制度中的不同情况下采用不同的处理方法。

1

相关内容

可理解性

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【经典书】线性代数，286页pdf

【经典书】线性代数，286页pdf

专知会员服务

132+阅读 · 2021年2月28日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

7+阅读 · 2018年8月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Variable Selection in Regression-based Estimation of Dynamic Treatment Regimes

Arxiv

0+阅读 · 2021年9月14日

Wavelet Shrinkage in Nonparametric Regression Models with Positive Noise

Wavelet Shrinkage in Nonparametric Regression Models with Positive Noise

Arxiv

0+阅读 · 2021年9月13日

On the Impact of Spurious Correlation for Out-of-distribution Detection

Arxiv

1+阅读 · 2021年9月12日

Splitting the Sample at the Largest Uncensored Observation

Arxiv

0+阅读 · 2021年9月12日

Making Table Understanding Work in Practice

Arxiv

0+阅读 · 2021年9月11日

Investigating and Mitigating Degree-Related Biases in Graph Convolutional Networks

Arxiv

6+阅读 · 2020年8月13日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Discriminative structural graph classification

Arxiv

5+阅读 · 2019年6月5日

Linkage between Piecewise Constant Mumford-Shah model and ROF model and its virtue in image segmentation

Linkage between Piecewise Constant Mumford-Shah model and ROF model and its virtue in image segmentation

Arxiv

4+阅读 · 2018年7月26日

VIP会员

文章信息

相关主题

Neural Networks

相关VIP内容

【因果基础】Causality Basics，36页ppt

专知会员服务

52+阅读 · 2021年8月8日

【经典书】线性代数，286页pdf

【经典书】线性代数，286页pdf

专知会员服务

132+阅读 · 2021年2月28日

最新《自监督表示学习》报告，70页ppt

最新《自监督表示学习》报告，70页ppt

专知会员服务

86+阅读 · 2020年12月22日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

因果图，Causal Graphs，52页ppt

因果图，Causal Graphs，52页ppt

专知会员服务

252+阅读 · 2020年4月19日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《军事域人工智能风险、机遇与治理战略指导报告》2025最新76页报告

《杀伤网与精确规模：智能饱和战争时代的战略要务-印度视角》2025最新报告

俄乌冲突的地缘政治与军事教训（万字长文）

《弹药快速效能建模：推进互操作性与技术优势》2025最新26页报告

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

已删除

将门创投

7+阅读 · 2018年8月28日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】图上的表示学习综述

【论文】图上的表示学习综述

机器学习研究会

15+阅读 · 2017年9月24日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Variable Selection in Regression-based Estimation of Dynamic Treatment Regimes

Arxiv

0+阅读 · 2021年9月14日

Wavelet Shrinkage in Nonparametric Regression Models with Positive Noise

Wavelet Shrinkage in Nonparametric Regression Models with Positive Noise

Arxiv

0+阅读 · 2021年9月13日

On the Impact of Spurious Correlation for Out-of-distribution Detection

Arxiv

1+阅读 · 2021年9月12日

Splitting the Sample at the Largest Uncensored Observation

Arxiv

0+阅读 · 2021年9月12日

Making Table Understanding Work in Practice

Arxiv

0+阅读 · 2021年9月11日

Investigating and Mitigating Degree-Related Biases in Graph Convolutional Networks

Arxiv

6+阅读 · 2020年8月13日

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

LayoutLM: Pre-training of Text and Layout for Document Image Understanding

Arxiv

12+阅读 · 2020年2月19日

Understanding Attention and Generalization in Graph Neural Networks

Arxiv

4+阅读 · 2019年10月28日

Discriminative structural graph classification

Arxiv

5+阅读 · 2019年6月5日

Linkage between Piecewise Constant Mumford-Shah model and ROF model and its virtue in image segmentation

Linkage between Piecewise Constant Mumford-Shah model and ROF model and its virtue in image segmentation

Arxiv

4+阅读 · 2018年7月26日

微信扫码咨询专知VIP会员