混合专家模式中示范选择的非药物性惩罚性标准 (A non-asymptotic penalization criterion for model selection in mixture of experts models) - 专知论文

会员服务 ·

0

混合专家模型 · 估计/估计量 · MoDELS · 模型选择 · Performer ·

2021 年 4 月 6 日

A non-asymptotic penalization criterion for model selection in mixture of experts models

翻译：混合专家模式中示范选择的非药物性惩罚性标准

TrungTin Nguyen,Hien Duy Nguyen,Faicel Chamroukhi,Florence Forbes

Mixture of experts (MoE) is a popular class of models in statistics and machine learning that has sustained attention over the years, due to its flexibility and effectiveness. We consider the Gaussian-gated localized MoE (GLoME) regression model for modeling heterogeneous data. This model poses challenging questions with respect to the statistical estimation and model selection problems, including feature selection, both from the computational and theoretical points of view. We study the problem of estimating the number of components of the GLoME model, in a penalized maximum likelihood estimation framework. We provide a lower bound on the penalty that ensures a weak oracle inequality is satisfied by our estimator. To support our theoretical result, we perform numerical experiments on simulated and real data, which illustrate the performance of our finite-sample oracle inequality.

翻译：专家混合(MoE)是统计和机器学习方面最受欢迎的模型,多年来因其灵活性和有效性而一直受到关注。我们认为高山化本地化的MOE(GLOME)回归模型用于建模多种数据。这一模型对统计估计和模型选择问题提出了具有挑战性的问题,包括从计算和理论角度选择特征。我们研究了在受处罚的最大可能性估计框架内估算GLOME模型组成部分数量的问题。我们对于确保我们的估算者满足弱骨骼不平等的处罚提供了较低的约束。为了支持我们的理论结果,我们对模拟和真实数据进行了数字实验,这显示了我们有限的标本或标本不平等的表现。

0

相关内容

混合专家模型

混合专家模型

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

威斯康辛大学《机器学习导论》2020秋季课程完结，课件、视频资源已开放

专知会员服务

16+阅读 · 2020年12月25日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

已删除

将门创投

7+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

Distributional Results for Model-Based Intrinsic Dimension Estimators

Distributional Results for Model-Based Intrinsic Dimension Estimators

Arxiv

0+阅读 · 2021年6月1日

Robust design optimisation of continuous flow polymerase chain reaction thermal flow systems

Robust design optimisation of continuous flow polymerase chain reaction thermal flow systems

Arxiv

0+阅读 · 2021年6月1日

Minimizing Sensitivity to Model Misspecification

Arxiv

0+阅读 · 2021年6月1日

Improved error estimates of hybridizable interior penalty methods using a variable penalty for highly anisotropic diffusion problems

Arxiv

0+阅读 · 2021年6月1日

Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization

Arxiv

0+阅读 · 2021年5月31日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2021年5月29日

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Arxiv

0+阅读 · 2021年5月28日

Optimality of Cross-validation in Scattered Data Approximation

Arxiv

0+阅读 · 2021年5月28日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

VIP会员

文章信息

相关主题

混合专家模型

估计/估计量

相关VIP内容

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

威斯康辛大学《机器学习导论》2020秋季课程完结，课件、视频资源已开放

专知会员服务

16+阅读 · 2020年12月25日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

知识图谱嵌入模型的概率标定,Probability Calibration for Knowledge Graph Embedding Models

专知会员服务

36+阅读 · 2020年5月11日

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

【知识图谱嵌入补全综述论文】embedding models for knowledge base completion

专知会员服务

102+阅读 · 2020年4月25日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

【新书】Python编程基础，669页pdf

【新书】Python编程基础，669页pdf

专知会员服务

196+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《巡飞弹药（爆炸性无人机）威胁态势分析》最新24页报告

《军用后勤无人机：破解战场运输挑战的创新方案》

人工智能战争：以色列、伊朗与新型AI战争形态

《俄乌战争：现代战争未来的启示与经验》

相关资讯

已删除

将门创投

7+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

相关论文

Distributional Results for Model-Based Intrinsic Dimension Estimators

Distributional Results for Model-Based Intrinsic Dimension Estimators

Arxiv

0+阅读 · 2021年6月1日

Robust design optimisation of continuous flow polymerase chain reaction thermal flow systems

Robust design optimisation of continuous flow polymerase chain reaction thermal flow systems

Arxiv

0+阅读 · 2021年6月1日

Minimizing Sensitivity to Model Misspecification

Arxiv

0+阅读 · 2021年6月1日

Improved error estimates of hybridizable interior penalty methods using a variable penalty for highly anisotropic diffusion problems

Arxiv

0+阅读 · 2021年6月1日

Near-optimal Local Convergence of Alternating Gradient Descent-Ascent for Minimax Optimization

Arxiv

0+阅读 · 2021年5月31日

Transfer Learning under High-dimensional Generalized Linear Models

Arxiv

0+阅读 · 2021年5月29日

Influence of sampling on the convergence rates of greedy algorithms for parameter-dependent random variables

Arxiv

0+阅读 · 2021年5月28日

Optimality of Cross-validation in Scattered Data Approximation

Arxiv

0+阅读 · 2021年5月28日

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Arxiv

7+阅读 · 2018年6月12日

The Search Problem in Mixture Models

Arxiv

3+阅读 · 2018年2月24日

微信扫码咨询专知VIP会员