FILM-组成组合:通过地貌-自线性线性移动进行深层学习的概率 (FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation) - 专知论文

会员服务 ·

0

集成 · Networking · Learning · 线性的 · MoDELS ·

2022 年 12 月 19 日

FiLM-Ensemble: Probabilistic Deep Learning via Feature-wise Linear Modulation

翻译：FILM-组成组合:通过地貌-自线性线性移动进行深层学习的概率

Mehmet Ozgur Turkoglu,Alexander Becker,Hüseyin Anil Gündüz,Mina Rezaei,Bernd Bischl,Rodrigo Caye Daudt,Stefano D'Aronco,Jan Dirk Wegner,Konrad Schindler

from arxiv, accepted at NeurIPS 2022

The ability to estimate epistemic uncertainty is often crucial when deploying machine learning in the real world, but modern methods often produce overconfident, uncalibrated uncertainty predictions. A common approach to quantify epistemic uncertainty, usable across a wide class of prediction models, is to train a model ensemble. In a naive implementation, the ensemble approach has high computational cost and high memory demand. This challenges in particular modern deep learning, where even a single deep network is already demanding in terms of compute and memory, and has given rise to a number of attempts to emulate the model ensemble without actually instantiating separate ensemble members. We introduce FiLM-Ensemble, a deep, implicit ensemble method based on the concept of Feature-wise Linear Modulation (FiLM). That technique was originally developed for multi-task learning, with the aim of decoupling different tasks. We show that the idea can be extended to uncertainty quantification: by modulating the network activations of a single deep network with FiLM, one obtains a model ensemble with high diversity, and consequently well-calibrated estimates of epistemic uncertainty, with low computational overhead in comparison. Empirically, FiLM-Ensemble outperforms other implicit ensemble methods, and it and comes very close to the upper bound of an explicit ensemble of networks (sometimes even beating it), at a fraction of the memory cost.

翻译：在现实世界中部署机器学习时,估计隐含不确定性的能力往往至关重要,但在现实世界中,现代方法往往会产生过度自信、未经校正的不确定性预测。一种用于在广泛的预测模型中量化隐含不确定性的常见方法,是培训一个模型组合。在天真的实施中,混合方法具有很高的计算成本和高记忆需求。这种挑战在现代深层次学习中尤为重要,即使是一个深层次的网络在计算和记忆方面都已经要求很高,并导致一些尝试在不实际使单独的混合成员即时地模仿模型组合。我们采用基于精密线性模型概念的深层隐含串联的方法来量化隐含不确定性。这种技术最初是为多功能学习而开发的,目的是解析不同任务。我们表明,这种想法可以扩大到不确定性的量化:通过一个单一的深层次网络的激活,而实际上不易实现单独混合的混合成员。我们引入了一个基于高性线性线性模型的深度混合方法,一种深度的隐含的混合方法,一个在高层次的模型中,一个具有高层次的内定的内存的内存的内存的内存方法。

0

相关内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

基于内嵌光纤FRP智能锚杆的边坡锚固稳定性监测与评价方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高功率石墨烯束流窗口的研究

国家自然科学基金

0+阅读 · 2015年12月31日

功率变换器非线性不稳定行为的washout滤波器控制方法

国家自然科学基金

0+阅读 · 2012年12月31日

Pt/TiMxOy/Pt/Si界面调控及忆阻行为调制机理

国家自然科学基金

0+阅读 · 2012年12月31日

利用高度有序的有机纳米结构阵列提高聚合物光伏器件效率及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

烟酰胺磷酸核糖转移酶在膀胱癌中的标记作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

超窄滞后Ti-Ni-Cu-X（X=Pd, Pt, Au）记忆合金薄膜的马氏体相变与记忆效应稳定性机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

非接触式同步电机转子励磁新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

飞秒激光对分子非绝热过程的量子控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

Probabilistic Qualitative Localization and Mapping

Arxiv

0+阅读 · 2023年2月17日

Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization

Arxiv

0+阅读 · 2023年2月16日

Special Properties of Gradient Descent with Large Learning Rates

Arxiv

0+阅读 · 2023年2月16日

Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits

Arxiv

0+阅读 · 2023年2月16日

Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

Arxiv

0+阅读 · 2023年2月15日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Arxiv

12+阅读 · 2021年12月16日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

VIP会员

文章信息

相关主题

相关VIP内容

“CVPR 2021 接受论文列表 1663篇论文都在这了

专知会员服务

32+阅读 · 2021年6月12日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

165+阅读 · 2020年3月18日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Stabilizing Transformers for Reinforcement Learning

Stabilizing Transformers for Reinforcement Learning

专知会员服务

60+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

[综述]深度学习下的场景文本检测与识别

[综述]深度学习下的场景文本检测与识别

专知会员服务

78+阅读 · 2019年10月10日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从社会学实验到行为仿真：理解基于Agent的观点动力学建模思维

中英文版《GPT-5 System Card速览》报告

ACL 2025 | 大模型结构化知识提示的泛化能力研究

【普林斯顿博士论文】大型模型的高效推理

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

ACM MM 2022 Call for Papers

ACM MM 2022 Call for Papers

CCF多媒体专委会

5+阅读 · 2022年3月29日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Latest News & Announcements of the Tutorial

【ICIG2021】Latest News & Announcements of the Tutorial

中国图象图形学学会CSIG

3+阅读 · 2021年12月20日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium4

中国图象图形学学会CSIG

0+阅读 · 2021年11月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Probabilistic Qualitative Localization and Mapping

Arxiv

0+阅读 · 2023年2月17日

Fast and Robust Non-Rigid Registration Using Accelerated Majorization-Minimization

Arxiv

0+阅读 · 2023年2月16日

Special Properties of Gradient Descent with Large Learning Rates

Arxiv

0+阅读 · 2023年2月16日

Understanding the Distillation Process from Deep Generative Models to Tractable Probabilistic Circuits

Arxiv

0+阅读 · 2023年2月16日

Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

Arxiv

0+阅读 · 2023年2月15日

Balanced Multimodal Learning via On-the-fly Gradient Modulation

Arxiv

13+阅读 · 2022年3月29日

Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation

Arxiv

12+阅读 · 2021年12月16日

Deep learning: a statistical viewpoint

Arxiv

18+阅读 · 2021年3月16日

A Wholistic View of Continual Learning with Deep Neural Networks: Forgotten Lessons and the Bridge to Active and Open World Learning

Arxiv

35+阅读 · 2020年9月3日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

相关基金

基于内嵌光纤FRP智能锚杆的边坡锚固稳定性监测与评价方法研究

国家自然科学基金

0+阅读 · 2015年12月31日

高功率石墨烯束流窗口的研究

国家自然科学基金

0+阅读 · 2015年12月31日

功率变换器非线性不稳定行为的washout滤波器控制方法

国家自然科学基金

0+阅读 · 2012年12月31日

Pt/TiMxOy/Pt/Si界面调控及忆阻行为调制机理

国家自然科学基金

0+阅读 · 2012年12月31日

利用高度有序的有机纳米结构阵列提高聚合物光伏器件效率及其机理研究

国家自然科学基金

0+阅读 · 2012年12月31日

烟酰胺磷酸核糖转移酶在膀胱癌中的标记作用及其分子机制研究

国家自然科学基金

0+阅读 · 2011年12月31日

超窄滞后Ti-Ni-Cu-X（X=Pd, Pt, Au）记忆合金薄膜的马氏体相变与记忆效应稳定性机理研究

国家自然科学基金

0+阅读 · 2011年12月31日

非接触式同步电机转子励磁新方法研究

国家自然科学基金

0+阅读 · 2011年12月31日

飞秒激光对分子非绝热过程的量子控制研究

国家自然科学基金

0+阅读 · 2009年12月31日

非线性不连续系统的稳定与镇定

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员