转移知识从头到尾：长尾分布下的不确定性校准 (Transfer Knowledge from Head to Tail: Uncertainty Calibration under Long-tailed Distribution) - 专知论文

会员服务 ·

0

长尾分布 · 类别 · 不确定 · 不确定性 · CIFAR-10 ·

2023 年 4 月 13 日

Transfer Knowledge from Head to Tail: Uncertainty Calibration under Long-tailed Distribution

翻译：转移知识从头到尾：长尾分布下的不确定性校准

Jiahao Chen,Bing Su

How to estimate the uncertainty of a given model is a crucial problem. Current calibration techniques treat different classes equally and thus implicitly assume that the distribution of training data is balanced, but ignore the fact that real-world data often follows a long-tailed distribution. In this paper, we explore the problem of calibrating the model trained from a long-tailed distribution. Due to the difference between the imbalanced training distribution and balanced test distribution, existing calibration methods such as temperature scaling can not generalize well to this problem. Specific calibration methods for domain adaptation are also not applicable because they rely on unlabeled target domain instances which are not available. Models trained from a long-tailed distribution tend to be more overconfident to head classes. To this end, we propose a novel knowledge-transferring-based calibration method by estimating the importance weights for samples of tail classes to realize long-tailed calibration. Our method models the distribution of each class as a Gaussian distribution and views the source statistics of head classes as a prior to calibrate the target distributions of tail classes. We adaptively transfer knowledge from head classes to get the target probability density of tail classes. The importance weight is estimated by the ratio of the target probability density over the source probability density. Extensive experiments on CIFAR-10-LT, MNIST-LT, CIFAR-100-LT, and ImageNet-LT datasets demonstrate the effectiveness of our method.

翻译：如何估计给定模型的不确定性是一个关键问题。当前的校准技术平等地处理不同的类别，并暗含着训练数据的分布是平衡的，但忽略了现实世界中数据往往服从长尾分布的事实。在这篇文章中，我们探讨了调整长尾分布模型的校准问题。由于不平衡的训练分布和平衡的测试分布之间的差别，现有的校准方法（如温度缩放）无法很好地推广到这个问题。特定的领域适应校准方法也不适用，因为它们依赖于不可用的目标域实例。从长尾分布中训练的模型往往对头部类别更过度自信。因此，我们提出了一种新颖的基于知识转移的校准方法，通过估计尾部类别样本的重要性权重来实现长尾校准。我们将每个类别的分布建模为高斯分布，并将头部类别的源统计信息视为先验来校准尾部类别的目标分布。我们适应性地从头部类别中传递知识来获取尾部类别的目标概率密度。通过目标概率密度与源概率密度的比值来估计重要性权重。CIFAR-10-LT、MNIST-LT、CIFAR-100-LT和ImageNet-LT数据集上的广泛实验证明了我们方法的有效性。

0

相关内容

长尾分布

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

专知会员服务

13+阅读 · 2022年3月19日

最新《计算机视觉领域泛化Domain Generalization》综述论文，18页pdf229篇文献

专知会员服务

57+阅读 · 2021年7月27日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

专知会员服务

11+阅读 · 2020年12月8日

近期必读的六篇 ICML 2020【因果推理】相关论文

近期必读的六篇 ICML 2020【因果推理】相关论文

专知会员服务

88+阅读 · 2020年9月8日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

专知会员服务

62+阅读 · 2020年1月11日

【NeurIPS2019】模仿学习中的因果混乱问题 Causal Confusion in Imitation Learning

【NeurIPS2019】模仿学习中的因果混乱问题 Causal Confusion in Imitation Learning

专知会员服务

30+阅读 · 2019年12月10日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

EMNLP 2022 | 校准预训练模型中的事实知识

EMNLP 2022 | 校准预训练模型中的事实知识

PaperWeekly

1+阅读 · 2022年11月22日

ACML 2022｜三行代码解决长尾不平衡类别分类

ACML 2022｜三行代码解决长尾不平衡类别分类

极市平台

2+阅读 · 2022年11月3日

ICML 2022 | 基于有偏不对称对比学习的长尾分布外检测

ICML 2022 | 基于有偏不对称对比学习的长尾分布外检测

PaperWeekly

0+阅读 · 2022年9月20日

ACL 2022 | 分解的元学习小样本命名实体识别

ACL 2022 | 分解的元学习小样本命名实体识别

PaperWeekly

1+阅读 · 2022年6月30日

深度学习模型不确定性方法对比

深度学习模型不确定性方法对比

PaperWeekly

20+阅读 · 2020年2月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

不确定分数阶非线性系统Mittag-Leffler自适应控制

国家自然科学基金

1+阅读 · 2016年12月31日

Nr5a2/SALL4维持胃癌干细胞自我更新能力的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

p-MSK1 (Thr581)影响结直肠癌预后的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

胃癌17q21.33缺失区内候选基因TOB1功能及其失活机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

批次过程数据模量驱动的分布中心匹配故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

小麦抗旱相关ERF转录因子介导的信号传递网络解析

国家自然科学基金

0+阅读 · 2011年12月31日

对偶自适应控制问题研究

国家自然科学基金

0+阅读 · 2008年12月31日

Prediction under hypothetical interventions: evaluation of performance using longitudinal observational data

Arxiv

0+阅读 · 2023年5月31日

Hypothesis Transfer Learning with Surrogate Classification Losses

Arxiv

0+阅读 · 2023年5月31日

Detecting hidden confounding in observational data using multiple environments

Arxiv

0+阅读 · 2023年5月29日

Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification

Arxiv

0+阅读 · 2023年5月29日

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

Arxiv

0+阅读 · 2023年5月26日

Stochastic metrology and the empirical distribution

Arxiv

0+阅读 · 2023年5月25日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

VIP会员

文章信息

相关主题

相关VIP内容

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

【CVPR 2022】长尾视觉数据识别的嵌套式协同学习方法 Nested Collaborative Learning for Long-Tailed Visual Recognition

专知会员服务

13+阅读 · 2022年3月19日

最新《计算机视觉领域泛化Domain Generalization》综述论文，18页pdf229篇文献

专知会员服务

57+阅读 · 2021年7月27日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

NeurIPS 2020最佳论文奖项出炉！GPT-3、伯克利等3篇论文摘得！

专知会员服务

11+阅读 · 2020年12月8日

近期必读的六篇 ICML 2020【因果推理】相关论文

近期必读的六篇 ICML 2020【因果推理】相关论文

专知会员服务

88+阅读 · 2020年9月8日

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

【异构图迁移的零样本学习】Heterogeneous Graph-based Knowledge Transfer for Generalized Zero-shot Learning

专知会员服务

66+阅读 · 2020年4月17日

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

【浙江大学-AAAI2020】领域自适应的对抗损失，Adversarial-Learned Loss for Domain Adaptation

专知会员服务

62+阅读 · 2020年1月11日

【NeurIPS2019】模仿学习中的因果混乱问题 Causal Confusion in Imitation Learning

【NeurIPS2019】模仿学习中的因果混乱问题 Causal Confusion in Imitation Learning

专知会员服务

30+阅读 · 2019年12月10日

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

【CoRL2019最佳论文】模仿学习，A Divergence Minimization Perspective on Imitation Learning Methods

专知会员服务

24+阅读 · 2019年11月11日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【CMU博士论文】数据驱动决策中的激励、信息与不确定性

DGP双粒度提示框架：图增强大模型助力欺诈检测

【ICCV2025】ESSENTIAL：用于视频类增量学习的情景记忆与语义记忆整合

唯快不破：大型语言模型高效架构综述

相关资讯

EMNLP 2022 | 校准预训练模型中的事实知识

EMNLP 2022 | 校准预训练模型中的事实知识

PaperWeekly

1+阅读 · 2022年11月22日

ACML 2022｜三行代码解决长尾不平衡类别分类

ACML 2022｜三行代码解决长尾不平衡类别分类

极市平台

2+阅读 · 2022年11月3日

ICML 2022 | 基于有偏不对称对比学习的长尾分布外检测

ICML 2022 | 基于有偏不对称对比学习的长尾分布外检测

PaperWeekly

0+阅读 · 2022年9月20日

ACL 2022 | 分解的元学习小样本命名实体识别

ACL 2022 | 分解的元学习小样本命名实体识别

PaperWeekly

1+阅读 · 2022年6月30日

深度学习模型不确定性方法对比

深度学习模型不确定性方法对比

PaperWeekly

20+阅读 · 2020年2月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Prediction under hypothetical interventions: evaluation of performance using longitudinal observational data

Arxiv

0+阅读 · 2023年5月31日

Hypothesis Transfer Learning with Surrogate Classification Losses

Arxiv

0+阅读 · 2023年5月31日

Detecting hidden confounding in observational data using multiple environments

Arxiv

0+阅读 · 2023年5月29日

Ensemble Multi-Quantiles: Adaptively Flexible Distribution Prediction for Uncertainty Quantification

Arxiv

0+阅读 · 2023年5月29日

CREST: A Joint Framework for Rationalization and Counterfactual Text Generation

Arxiv

0+阅读 · 2023年5月26日

Stochastic metrology and the empirical distribution

Arxiv

0+阅读 · 2023年5月25日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

A Survey of Uncertainty in Deep Neural Networks

Arxiv

30+阅读 · 2021年7月7日

The Causal Learning of Retail Delinquency

Arxiv

14+阅读 · 2020年12月17日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

17+阅读 · 2019年10月30日

相关基金

不确定分数阶非线性系统Mittag-Leffler自适应控制

国家自然科学基金

1+阅读 · 2016年12月31日

Nr5a2/SALL4维持胃癌干细胞自我更新能力的分子机制研究

国家自然科学基金

0+阅读 · 2013年12月31日

三维椭圆方程Cauchy问题的正则化方法

国家自然科学基金

0+阅读 · 2013年12月31日

p-MSK1 (Thr581)影响结直肠癌预后的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

不确定性推理与语义网中知识表示的数学基础

国家自然科学基金

18+阅读 · 2012年12月31日

胃癌17q21.33缺失区内候选基因TOB1功能及其失活机制的研究

国家自然科学基金

0+阅读 · 2012年12月31日

似然方法的有限样本研究

国家自然科学基金

0+阅读 · 2011年12月31日

批次过程数据模量驱动的分布中心匹配故障诊断研究

国家自然科学基金

0+阅读 · 2011年12月31日

小麦抗旱相关ERF转录因子介导的信号传递网络解析

国家自然科学基金

0+阅读 · 2011年12月31日

对偶自适应控制问题研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员