Regixup 组合: 混合成常规化器, 能够令人惊讶地改进准确度并排除分布强度 (RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness) - 专知论文

会员服务 ·

0

Mixup · 正则化项 · 模型评估 · 稳健性 · Learning ·

2023 年 2 月 6 日

RegMixup: Mixup as a Regularizer Can Surprisingly Improve Accuracy and Out Distribution Robustness

翻译：Regixup 组合: 混合成常规化器, 能够令人惊讶地改进准确度并排除分布强度

Francesco Pinto,Harry Yang,Ser-Nam Lim,Philip H. S. Torr,Puneet K. Dokania

from arxiv, 22 pages, 18 figures

We show that the effectiveness of the well celebrated Mixup [Zhang et al., 2018] can be further improved if instead of using it as the sole learning objective, it is utilized as an additional regularizer to the standard cross-entropy loss. This simple change not only provides much improved accuracy but also significantly improves the quality of the predictive uncertainty estimation of Mixup in most cases under various forms of covariate shifts and out-of-distribution detection experiments. In fact, we observe that Mixup yields much degraded performance on detecting out-of-distribution samples possibly, as we show empirically, because of its tendency to learn models that exhibit high-entropy throughout; making it difficult to differentiate in-distribution samples from out-distribution ones. To show the efficacy of our approach (RegMixup), we provide thorough analyses and experiments on vision datasets (ImageNet & CIFAR-10/100) and compare it with a suite of recent approaches for reliable uncertainty estimation.

翻译：我们表明,如果人们所熟知的混合[Zhang等人,2018年]能够进一步提高其效力,如果它不是将它作为唯一的学习目标,而是作为标准跨热带作物损失的又一种常规化剂使用。这种简单的改变不仅提高了准确性,而且大大改善了在各种形式的共变转移和分配外检测实验下多数情况下对混合的预测不确定性估计的质量。事实上,我们观察到,混合在检测分配外样本方面产生非常低的性能,正如我们的经验所表明的那样,因为它倾向于学习在整个过程中表现出高持久性的模型;难以区分分配中的样本和分配外的样本。为了展示我们的方法(RegMixup)的功效,我们提供了对视觉数据集(ImageNet & CIFAR-10,100)的透彻分析和实验,并将它与最近一套可靠不确定性估计方法进行比较。

0

相关内容

Mixup

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

22篇论文！增量学习/终生学习论文资源列表

22篇论文！增量学习/终生学习论文资源列表

专知

32+阅读 · 2018年12月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

复杂场景下非合作目标鲁棒识别方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

Blimp-1对小鼠allo-HSCT后GVHD发病的调控作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

可压缩湍流粒子输运的拉格朗日（Lagrangian）研究

国家自然科学基金

0+阅读 · 2013年12月31日

针对结构健康监测的无线传感网校准问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

加速寿命试验中小样本最优设计方法

国家自然科学基金

0+阅读 · 2012年12月31日

PGE2/EP2介导间充质干细胞向急性肺损伤肺组织归巢的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

航空发动机刚度非线性转子动力学特性与参数识别研究

国家自然科学基金

0+阅读 · 2011年12月31日

内皮素-1受体阻断剂降低慢性间歇性低氧诱发的大鼠高血压的神经机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

ROS介导的APE-1和PI3K/Akt信号通路对H.pylori诱导胃上皮细胞凋亡增殖的作用

国家自然科学基金

0+阅读 · 2011年12月31日

喉鳞癌中趋化因子受体介导CD4+CD25+调节性T细胞增殖的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

Arxiv

0+阅读 · 2023年3月29日

Diffusion Denoised Smoothing for Certified and Adversarial Robust Out-Of-Distribution Detection

Arxiv

0+阅读 · 2023年3月29日

Nearest Neighbor Based Out-of-Distribution Detection in Remote Sensing Scene Classification

Arxiv

0+阅读 · 2023年3月29日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

Improving Dual-Encoder Training through Dynamic Indexes for Negative Mining

Arxiv

0+阅读 · 2023年3月27日

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Arxiv

0+阅读 · 2023年3月24日

LINe: Out-of-Distribution Detection by Leveraging Important Neurons

Arxiv

0+阅读 · 2023年3月24日

Tyler's and Maronna's M-estimators: Non-Asymptotic Concentration Results

Arxiv

0+阅读 · 2023年3月24日

Anomaly Detection under Distribution Shift

Arxiv

0+阅读 · 2023年3月24日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

ICLR 2022杰出论文公布：7篇论文获得，清华朱军课题组摘得

专知会员服务

60+阅读 · 2022年4月22日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

最大均方差正则化贝叶斯神经网络，Bayesian Neural Networks With Maximum Mean Discrepancy Regularization

专知会员服务

54+阅读 · 2020年3月5日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

领域特定文本分类中的预训练语言模型新进展：系统综述

实时无人机指令处理：一种面向无人机系统的大语言模型方法

领域特定文本分类中的预训练语言模型新进展：系统综述

大型语言模型（LLM）赋能的知识图谱构建：综述

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

22篇论文！增量学习/终生学习论文资源列表

22篇论文！增量学习/终生学习论文资源列表

专知

32+阅读 · 2018年12月27日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective

Arxiv

0+阅读 · 2023年3月29日

Diffusion Denoised Smoothing for Certified and Adversarial Robust Out-Of-Distribution Detection

Arxiv

0+阅读 · 2023年3月29日

Nearest Neighbor Based Out-of-Distribution Detection in Remote Sensing Scene Classification

Arxiv

0+阅读 · 2023年3月29日

Statistical Inference with Stochastic Gradient Methods under $φ$-mixing Data

Arxiv

0+阅读 · 2023年3月28日

Improving Dual-Encoder Training through Dynamic Indexes for Negative Mining

Arxiv

0+阅读 · 2023年3月27日

Tackling the infinite likelihood problem when fitting mixtures of shifted asymmetric Laplace distributions

Arxiv

0+阅读 · 2023年3月24日

LINe: Out-of-Distribution Detection by Leveraging Important Neurons

Arxiv

0+阅读 · 2023年3月24日

Tyler's and Maronna's M-estimators: Non-Asymptotic Concentration Results

Arxiv

0+阅读 · 2023年3月24日

Anomaly Detection under Distribution Shift

Arxiv

0+阅读 · 2023年3月24日

Prompt Distribution Learning

Arxiv

14+阅读 · 2022年5月6日

相关基金

复杂场景下非合作目标鲁棒识别方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

Blimp-1对小鼠allo-HSCT后GVHD发病的调控作用和机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

可压缩湍流粒子输运的拉格朗日（Lagrangian）研究

国家自然科学基金

0+阅读 · 2013年12月31日

针对结构健康监测的无线传感网校准问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

加速寿命试验中小样本最优设计方法

国家自然科学基金

0+阅读 · 2012年12月31日

PGE2/EP2介导间充质干细胞向急性肺损伤肺组织归巢的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

航空发动机刚度非线性转子动力学特性与参数识别研究

国家自然科学基金

0+阅读 · 2011年12月31日

内皮素-1受体阻断剂降低慢性间歇性低氧诱发的大鼠高血压的神经机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

ROS介导的APE-1和PI3K/Akt信号通路对H.pylori诱导胃上皮细胞凋亡增殖的作用

国家自然科学基金

0+阅读 · 2011年12月31日

喉鳞癌中趋化因子受体介导CD4+CD25+调节性T细胞增殖的作用机制研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员