脆弱可识别模型的预期-最大最大化快速分析 (Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models) - 专知论文

会员服务 ·

0

可辨认的 · 估计/估计量 · CASE · MoDELS · 高斯混合（模型） ·

2021 年 11 月 16 日

Sharp Analysis of Expectation-Maximization for Weakly Identifiable Models

翻译：脆弱可识别模型的预期-最大最大化快速分析

Raaz Dwivedi,Nhat Ho,Koulik Khamaru,Martin J. Wainwright,Michael I. Jordan,Bin Yu

from arxiv, 30 pages, 4 figures. The first three authors contributed equally to this work. To appear in AISTATS 2020

We study a class of weakly identifiable location-scale mixture models for which the maximum likelihood estimates based on $n$ i.i.d. samples are known to have lower accuracy than the classical $n^{- \frac{1}{2}}$ error. We investigate whether the Expectation-Maximization (EM) algorithm also converges slowly for these models. We provide a rigorous characterization of EM for fitting a weakly identifiable Gaussian mixture in a univariate setting where we prove that the EM algorithm converges in order $n^{\frac{3}{4}}$ steps and returns estimates that are at a Euclidean distance of order ${ n^{- \frac{1}{8}}}$ and ${ n^{-\frac{1} {4}}}$ from the true location and scale parameter respectively. Establishing the slow rates in the univariate setting requires a novel localization argument with two stages, with each stage involving an epoch-based argument applied to a different surrogate EM operator at the population level. We demonstrate several multivariate ($d \geq 2$) examples that exhibit the same slow rates as the univariate case. We also prove slow statistical rates in higher dimensions in a special case, when the fitted covariance is constrained to be a multiple of the identity.

翻译：我们研究的是一类可识别位置比例差的混合物模型,据了解,根据美元(i.d.)的样本,其最大可能性估计值低于典型的 $@-\\frac{1 ⁇ 2 ⁇ 2 ⁇ 1 ⁇ 2 ⁇ 1美元差错。我们调查期望-最大化算法是否也对这些模型分别缓慢地趋同。我们对EM进行严格的定性,以在一个单向环境中安装可识别度差的高萨混合物,在单向环境中,我们证明EM算法按照美元(n ⁇ frac{3 ⁇ 4 ⁇ 4 ⁇ 4 ⁇ 4美元)的步伐和返回估计值相交汇,在Euclidean 距离为 ${n ⁇ -\\\\frac{1 ⁇ 8 ⁇ 8$和${n ⁇ \\\\\\\frac{1}1 ⁇ 4 ⁇ 1美元之间,这些模型是否分别与真实的位置和尺度参数相交汇。在单向低位设置的慢速率需要一个新的本地化论证,每个阶段都有一个基于近地参数的参数适用于人口层面的EM操作操作。我们展示了多个多变位数率,在特殊身份上的一个案例是慢度。

0

相关内容

可辨认的

【ICML2021】压缩最大似然

专知会员服务

22+阅读 · 2021年9月23日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

专知会员服务

16+阅读 · 2019年10月31日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

弱监督语义分割最新方法资源列表

弱监督语义分割最新方法资源列表

专知

9+阅读 · 2019年2月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Error analysis for a statistical finite element method

Arxiv

0+阅读 · 2022年1月19日

Improved Receivers for Optical Wireless OFDM: An Information Theoretic Perspective

Improved Receivers for Optical Wireless OFDM: An Information Theoretic Perspective

Arxiv

0+阅读 · 2022年1月18日

Well-Conditioned Linear Minimum Mean Square Error Estimation

Arxiv

0+阅读 · 2022年1月18日

Decoupling Trends and Changepoint Analysis

Arxiv

0+阅读 · 2022年1月17日

Limits and consistency of non-local and graph approximations to the Eikonal equation

Arxiv

0+阅读 · 2022年1月17日

Faster Rates of Private Stochastic Convex Optimization

Arxiv

0+阅读 · 2022年1月16日

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Arxiv

0+阅读 · 2022年1月16日

Theoretical analysis and computation of the sample Frechet mean for sets of large graphs based on spectral information

Arxiv

0+阅读 · 2022年1月15日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

VIP会员

文章信息

相关主题

估计/估计量

高斯混合（模型）

相关VIP内容

【ICML2021】压缩最大似然

专知会员服务

22+阅读 · 2021年9月23日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

【论文推荐】不同图像域弱监督语义分割的综合分析，A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains

专知会员服务

28+阅读 · 2019年12月27日

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

【ICCV 2019 Workshop】Complete Dictionary Learning via L4-Norm Maximization over the Orthogonal Grou，加州大学伯克利分校马毅

专知会员服务

16+阅读 · 2019年10月31日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

热门VIP内容

开通专知VIP会员享更多权益服务

《美空军条令出版物：战略打击》最新条令

《高能激光武器》22页slides

军事前沿模型

《面向小型无人机或无人飞行器的创新雷达探测与人工智能分类技术》263页

相关资讯

Call for Participation: Shared Tasks in NLPCC 2019

Call for Participation: Shared Tasks in NLPCC 2019

中国计算机学会

5+阅读 · 2019年3月22日

弱监督语义分割最新方法资源列表

弱监督语义分割最新方法资源列表

专知

9+阅读 · 2019年2月26日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Error analysis for a statistical finite element method

Arxiv

0+阅读 · 2022年1月19日

Improved Receivers for Optical Wireless OFDM: An Information Theoretic Perspective

Improved Receivers for Optical Wireless OFDM: An Information Theoretic Perspective

Arxiv

0+阅读 · 2022年1月18日

Well-Conditioned Linear Minimum Mean Square Error Estimation

Arxiv

0+阅读 · 2022年1月18日

Decoupling Trends and Changepoint Analysis

Arxiv

0+阅读 · 2022年1月17日

Limits and consistency of non-local and graph approximations to the Eikonal equation

Arxiv

0+阅读 · 2022年1月17日

Faster Rates of Private Stochastic Convex Optimization

Arxiv

0+阅读 · 2022年1月16日

Differentially Private (Gradient) Expectation Maximization Algorithm with Statistical Guarantees

Arxiv

0+阅读 · 2022年1月16日

Theoretical analysis and computation of the sample Frechet mean for sets of large graphs based on spectral information

Arxiv

0+阅读 · 2022年1月15日

Meta-Learning with Implicit Gradients

Meta-Learning with Implicit Gradients

Arxiv

13+阅读 · 2019年9月10日

Implicit Maximum Likelihood Estimation

Implicit Maximum Likelihood Estimation

Arxiv

7+阅读 · 2018年9月24日

微信扫码咨询专知VIP会员