在高维专家混合型专家模式中通过惩罚进行示范选择的非救济办法 (A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models) - 专知论文

会员服务 ·

0

混合专家模型 · 模型选择 · MoDELS · 估计/估计量 · 统计量 ·

2022 年 5 月 11 日

A non-asymptotic approach for model selection via penalization in high-dimensional mixture of experts models

翻译：在高维专家混合型专家模式中通过惩罚进行示范选择的非救济办法

TrungTin Nguyen,Hien Duy Nguyen,Faicel Chamroukhi,Florence Forbes

from arxiv, Corrected typos. Major revision

Mixture of experts (MoE) are a popular class of statistical and machine learning models that have gained attention over the years due to their flexibility and efficiency. In this work, we consider Gaussian-gated localized MoE (GLoME) and block-diagonal covariance localized MoE (BLoME) regression models to present nonlinear relationships in heterogeneous data with potential hidden graph-structured interactions between high-dimensional predictors. These models pose difficult statistical estimation and model selection questions, both from a computational and theoretical perspective. This paper is devoted to the study of the problem of model selection among a collection of GLoME or BLoME models characterized by the number of mixture components, the complexity of Gaussian mean experts, and the hidden block-diagonal structures of the covariance matrices, in a penalized maximum likelihood estimation framework. In particular, we establish non-asymptotic risk bounds that take the form of weak oracle inequalities, provided that lower bounds for the penalties hold. The good empirical behavior of our models is then demonstrated on synthetic and real datasets.

翻译：专家混合(MoE)是一个受欢迎的统计和机器学习模型类别,多年来由于其灵活性和效率而引起人们的注意。在这项工作中,我们认为高斯加化的局部MOE(GLOME)和块对角共差局部MOE(BLOME)回归模型展示了不同数据的非线性关系,高维预测器之间可能隐藏的图形结构互动关系。这些模型从计算和理论角度提出了难以统计的估计和模型选择问题。本文专门研究以混合成分数量、高斯平均专家的复杂性和共差矩阵隐藏的块对角结构为特点的GLOME或BLOME模型集的模型选择问题。特别是,我们建立了非线性风险界限,这些界限以弱小或骨骼不平等的形式出现,但惩罚的界限较小。然后在合成和真实数据集中展示了我们模型的良好经验行为。

0

相关内容

混合专家模型

混合专家模型

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

基于含氟聚苯胺基纳米复合材料的室温高灵敏NO2气敏传感器研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

MDSCs在动脉粥样硬化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于极性单分子的量子态的制备与操控

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

外力场下片状NKN基粉体制备与高性能织构化陶瓷的研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型光栅生物传感器材料、制备和器件应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

ZnO基多孔阵列室温气体传感性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Arxiv

0+阅读 · 2022年7月1日

A geometric framework for outlier detection in high-dimensional data

Arxiv

0+阅读 · 2022年7月1日

Integrative Learning of Structured High-Dimensional Data from Multiple Datasets

Arxiv

0+阅读 · 2022年7月1日

"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

Arxiv

0+阅读 · 2022年6月30日

Best of Both Worlds Model Selection

Arxiv

0+阅读 · 2022年6月29日

Off-the-grid learning of sparse mixtures from a continuous dictionary

Arxiv

0+阅读 · 2022年6月29日

Active Exploration via Experiment Design in Markov Chains

Arxiv

0+阅读 · 2022年6月29日

Bayesian Multi-task Variable Selection with an Application to Differential DAG Analysis

Arxiv

0+阅读 · 2022年6月28日

Learning block structured graphs in Gaussian graphical models

Arxiv

0+阅读 · 2022年6月28日

ABC for model selection and parameter estimation of drill-string bit-rock interaction models and stochastic stability

Arxiv

0+阅读 · 2022年6月28日

VIP会员

文章信息

相关主题

混合专家模型

估计/估计量

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

神经常微分方程教程，50页ppt，A brief tutorial on Neural ODEs

专知会员服务

74+阅读 · 2020年8月2日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

【机器学习基础最新版】（Mathematics for Machine Learning），417页pdf

专知会员服务

244+阅读 · 2019年10月21日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

操作系统智能体：基于多模态大模型（MLLM）的通用计算设备智能体综述

《美国太空军系统全生命周期建模、仿真与分析效能提升方案》最新84页报告

【博士论文】推进数据高效的深度学习：非参数 Transformer、主动测试与上下文学习

自主人工智能：未来战争是否将是自主化的？

相关资讯

IEEE ICKG 2022: Call for Papers

IEEE ICKG 2022: Call for Papers

机器学习与推荐算法

3+阅读 · 2022年3月30日

AIART 2022 Call for Papers

AIART 2022 Call for Papers

CCF多媒体专委会

1+阅读 · 2022年2月13日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium8

中国图象图形学学会CSIG

0+阅读 · 2021年11月16日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Flexible Bayesian Product Mixture Models for Vector Autoregressions

Arxiv

0+阅读 · 2022年7月1日

A geometric framework for outlier detection in high-dimensional data

Arxiv

0+阅读 · 2022年7月1日

Integrative Learning of Structured High-Dimensional Data from Multiple Datasets

Arxiv

0+阅读 · 2022年7月1日

"Diversity and Uncertainty in Moderation" are the Key to Data Selection for Multilingual Few-shot Transfer

Arxiv

0+阅读 · 2022年6月30日

Best of Both Worlds Model Selection

Arxiv

0+阅读 · 2022年6月29日

Off-the-grid learning of sparse mixtures from a continuous dictionary

Arxiv

0+阅读 · 2022年6月29日

Active Exploration via Experiment Design in Markov Chains

Arxiv

0+阅读 · 2022年6月29日

Bayesian Multi-task Variable Selection with an Application to Differential DAG Analysis

Arxiv

0+阅读 · 2022年6月28日

Learning block structured graphs in Gaussian graphical models

Arxiv

0+阅读 · 2022年6月28日

ABC for model selection and parameter estimation of drill-string bit-rock interaction models and stochastic stability

Arxiv

0+阅读 · 2022年6月28日

相关基金

基于含氟聚苯胺基纳米复合材料的室温高灵敏NO2气敏传感器研究

国家自然科学基金

0+阅读 · 2013年12月31日

Calderon问题和边界刚性问题

国家自然科学基金

0+阅读 · 2013年12月31日

CoFe2O4/BaSrTiO3复合势垒多铁隧道结的制备及隧穿特性研究

国家自然科学基金

0+阅读 · 2013年12月31日

低交叉极化共形天线阵列综合的混合DE算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

MDSCs在动脉粥样硬化中的作用及机制

国家自然科学基金

0+阅读 · 2012年12月31日

基于极性单分子的量子态的制备与操控

国家自然科学基金

0+阅读 · 2011年12月31日

编码密码学中若干组合对象研究

国家自然科学基金

0+阅读 · 2009年12月31日

外力场下片状NKN基粉体制备与高性能织构化陶瓷的研究

国家自然科学基金

0+阅读 · 2009年12月31日

新型光栅生物传感器材料、制备和器件应用研究

国家自然科学基金

0+阅读 · 2008年12月31日

ZnO基多孔阵列室温气体传感性能研究

国家自然科学基金

0+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员