标题：使用半监督自编码器对损坏数据进行分类和不确定性量化 (Classification and Uncertainty Quantification of Corrupted Data using Semi-Supervised Autoencoders) - 专知论文

会员服务 ·

0

推断 · 自编码器 · 不确定 · 不确定性 · 不确定性量化 ·

2023 年 4 月 20 日

Classification and Uncertainty Quantification of Corrupted Data using Semi-Supervised Autoencoders

翻译：标题：使用半监督自编码器对损坏数据进行分类和不确定性量化

Philipp Joppich,Sebastian Dorn,Oliver De Candido,Wolfgang Utschick,Jakob Knollmüller

Parametric and non-parametric classifiers often have to deal with real-world data, where corruptions like noise, occlusions, and blur are unavoidable - posing significant challenges. We present a probabilistic approach to classify strongly corrupted data and quantify uncertainty, despite the model only having been trained with uncorrupted data. A semi-supervised autoencoder trained on uncorrupted data is the underlying architecture. We use the decoding part as a generative model for realistic data and extend it by convolutions, masking, and additive Gaussian noise to describe imperfections. This constitutes a statistical inference task in terms of the optimal latent space activations of the underlying uncorrupted datum. We solve this problem approximately with Metric Gaussian Variational Inference (MGVI). The supervision of the autoencoder's latent space allows us to classify corrupted data directly under uncertainty with the statistically inferred latent space activations. Furthermore, we demonstrate that the model uncertainty strongly depends on whether the classification is correct or wrong, setting a basis for a statistical "lie detector" of the classification. Independent of that, we show that the generative model can optimally restore the uncorrupted datum by decoding the inferred latent space activations.

翻译：摘要：参数化和非参数化分类器经常需要处理现实世界的数据，在这些数据中，噪声、遮挡和模糊等损伤是不可避免的，因此会带来重大挑战。我们提出了一种概率方法来分类强烈损坏的数据并量化其不确定性，尽管该模型只经过未受损数据的训练。半监督自编码器是底层体系结构。我们使用解码部分作为生成模型来模拟现实数据，通过卷积、掩模和加性高斯噪声来描述缺陷。这构成了一个统计推断任务，涉及到底层未受损数据的最佳潜在空间激活。我们使用度量高斯变分推断（Metric Gaussian Variational Inference，MGVI）来近似解决这个问题。自编码器潜在空间的监督允许我们直接分类带损坏的数据，并使用统计推断的潜在空间激活来量化不确定性。此外，我们证明了模型的不确定性强烈依赖于分类是否正确，为分类的统计"撒谎检测器"奠定了基础。独立于此，我们展示了，通过解码推断的潜在空间激活，生成模型可以最优地恢复未受损的数据。

0

相关内容

《校准自主性中的信任》2022最新16页slides

《校准自主性中的信任》2022最新16页slides

专知会员服务

20+阅读 · 2022年12月7日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

中科院自动化所17篇CVPR 2022 新作速览！

中科院自动化所17篇CVPR 2022 新作速览！

专知会员服务

20+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

康奈尔大学「深度概率与生成模型」2021SP课程

专知会员服务

49+阅读 · 2021年4月24日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

读书报告 | Deep Learning for Extreme Multi-label Text Classification

读书报告 | Deep Learning for Extreme Multi-label Text Classification

科技创新与创业

48+阅读 · 2018年1月10日

高维数据保真降维方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于广义半参数回归模型的统计推断及其应用研究

国家自然科学基金

2+阅读 · 2013年12月31日

不完全数据下分位数回归模型的经验似然推断

国家自然科学基金

1+阅读 · 2013年12月31日

罕见遗传变异关联性分析的统计方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维统计模型中的稳健推断及其应用

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

生物特征识别中高维数据的统计降维及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高超声速流中壁板的热弹性气动颤振及其主动控制

国家自然科学基金

0+阅读 · 2012年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching

Arxiv

0+阅读 · 2023年6月7日

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

Arxiv

0+阅读 · 2023年6月6日

CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models

Arxiv

0+阅读 · 2023年6月6日

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Arxiv

0+阅读 · 2023年6月6日

Fair and Optimal Classification via Post-Processing

Arxiv

0+阅读 · 2023年6月5日

A Data-Driven Measure of Relative Uncertainty for Misclassification Detection

Arxiv

0+阅读 · 2023年6月2日

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

Arxiv

0+阅读 · 2023年6月2日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

VIP会员

文章信息

相关主题

不确定性量化

相关VIP内容

《校准自主性中的信任》2022最新16页slides

《校准自主性中的信任》2022最新16页slides

专知会员服务

20+阅读 · 2022年12月7日

NeurlPS 2022 | 自然语言处理相关论文分类整理

NeurlPS 2022 | 自然语言处理相关论文分类整理

专知会员服务

51+阅读 · 2022年10月2日

中科院自动化所17篇CVPR 2022 新作速览！

中科院自动化所17篇CVPR 2022 新作速览！

专知会员服务

20+阅读 · 2022年3月19日

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

【干货书】深度学习合成数据，354页pdf，Synthetic Data for Deep Learning

专知会员服务

104+阅读 · 2022年2月10日

康奈尔大学「深度概率与生成模型」2021SP课程

专知会员服务

49+阅读 · 2021年4月24日

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

图解FixMatch的半监督学习，The Illustrated FixMatch for Semi-Supervised Learning

专知会员服务

26+阅读 · 2020年4月2日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

96+阅读 · 2020年3月12日

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

生成式对抗网络先验贝叶斯推断，Bayesian Inference with Generative Adversarial Network Priors

专知会员服务

28+阅读 · 2020年2月18日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

热门VIP内容

开通专知VIP会员享更多权益服务

《解析陆域作战方向：一个概念性框架》报告

《人工智能与人类的未来》2025年最新300页书籍

追寻真正的AI自主性：从遗留思维到战场优势

《“蛛网”行动：乌克兰不对称作战的演进》报告

相关资讯

Multi-Task Learning的几篇综述文章

Multi-Task Learning的几篇综述文章

深度学习自然语言处理

15+阅读 · 2020年6月15日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

【论文推荐】最新六篇对抗自编码器相关论文—多尺度网络节点表示、生成对抗自编码、逆映射、Wasserstein、条件对抗、去噪

专知

20+阅读 · 2018年4月7日

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

【论文推荐】最新5篇图像分割（Image Segmentation）相关论文—多重假设、超像素分割、自监督、图、生成对抗网络

专知

27+阅读 · 2018年2月7日

读书报告 | Deep Learning for Extreme Multi-label Text Classification

读书报告 | Deep Learning for Extreme Multi-label Text Classification

科技创新与创业

48+阅读 · 2018年1月10日

相关论文

Label Shift Quantification with Robustness Guarantees via Distribution Feature Matching

Arxiv

0+阅读 · 2023年6月7日

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Arxiv

0+阅读 · 2023年6月6日

A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

A Survey of Learning on Small Data: Generalization, Optimization, and Challenge

Arxiv

0+阅读 · 2023年6月6日

CUE: An Uncertainty Interpretation Framework for Text Classifiers Built on Pre-Trained Language Models

Arxiv

0+阅读 · 2023年6月6日

Data Efficient Training with Imbalanced Label Sample Distribution for Fashion Detection

Arxiv

0+阅读 · 2023年6月6日

Fair and Optimal Classification via Post-Processing

Arxiv

0+阅读 · 2023年6月5日

A Data-Driven Measure of Relative Uncertainty for Misclassification Detection

Arxiv

0+阅读 · 2023年6月2日

Scaling Up Semi-supervised Learning with Unconstrained Unlabelled Data

Arxiv

0+阅读 · 2023年6月2日

A Survey of Learning on Small Data

Arxiv

19+阅读 · 2022年7月29日

A Survey on Data Augmentation for Text Classification

A Survey on Data Augmentation for Text Classification

Arxiv

16+阅读 · 2021年7月7日

相关基金

高维数据保真降维方法研究

国家自然科学基金

2+阅读 · 2014年12月31日

复杂数据下含指标项半参数模型结构的统计推断及应用

国家自然科学基金

0+阅读 · 2014年12月31日

基于广义半参数回归模型的统计推断及其应用研究

国家自然科学基金

2+阅读 · 2013年12月31日

不完全数据下分位数回归模型的经验似然推断

国家自然科学基金

1+阅读 · 2013年12月31日

罕见遗传变异关联性分析的统计方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

高维统计模型中的稳健推断及其应用

国家自然科学基金

1+阅读 · 2012年12月31日

高维数据的图模型学习与统计推断

国家自然科学基金

8+阅读 · 2012年12月31日

生物特征识别中高维数据的统计降维及算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

高超声速流中壁板的热弹性气动颤振及其主动控制

国家自然科学基金

0+阅读 · 2012年12月31日

Unscented卡尔曼滤波算法及其在通信中的应用

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员