A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data - 专知论文

会员服务 ·

0

高斯混合（模型） · 稳健性 · 异常点 · 估计/估计量 · 数据填补 ·

2023 年 5 月 22 日

A Robust and Flexible EM Algorithm for Mixtures of Elliptical Distributions with Missing Data

翻译：暂无翻译

Florian Mouret,Alexandre Hippert-Ferrer,Frédéric Pascal,Jean-Yves Tourneret

This paper tackles the problem of missing data imputation for noisy and non-Gaussian data. A classical imputation method, the Expectation Maximization (EM) algorithm for Gaussian mixture models, has shown interesting properties when compared to other popular approaches such as those based on k-nearest neighbors or on multiple imputations by chained equations. However, Gaussian mixture models are known to be non-robust to heterogeneous data, which can lead to poor estimation performance when the data is contaminated by outliers or follows non-Gaussian distributions. To overcome this issue, a new EM algorithm is investigated for mixtures of elliptical distributions with the property of handling potential missing data. This paper shows that this problem reduces to the estimation of a mixture of Angular Gaussian distributions under generic assumptions (i.e., each sample is drawn from a mixture of elliptical distributions, which is possibly different for one sample to another). In that case, the complete-data likelihood associated with mixtures of elliptical distributions is well adapted to the EM framework with missing data thanks to its conditional distribution, which is shown to be a multivariate $t$-distribution. Experimental results on synthetic data demonstrate that the proposed algorithm is robust to outliers and can be used with non-Gaussian data. Furthermore, experiments conducted on real-world datasets show that this algorithm is very competitive when compared to other classical imputation methods.

翻译：暂无翻译

0

相关内容

高斯混合（模型）

高斯混合（模型）

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

调控miRNA成熟通路中SNPs与新疆维吾尔族结核病遗传易感性及后续功能分析

国家自然科学基金

0+阅读 · 2014年12月31日

Cbl家族调控c-Met介导的非小细胞肺癌放疗抵抗机制的研究

国家自然科学基金

1+阅读 · 2014年12月31日

马尔可夫过程在Girsanov变换下的性质及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

非线性波的时空复杂性研究

国家自然科学基金

0+阅读 · 2012年12月31日

非磁性元素掺杂稀磁半导体铁磁性机理研究的新方法

国家自然科学基金

0+阅读 · 2012年12月31日

Expert Aggregation for Financial Forecasting

Arxiv

0+阅读 · 2023年7月6日

Bayesian D- and I-optimal designs for choice experiments involving mixtures and process variables

Arxiv

0+阅读 · 2023年7月5日

Clustering Method for Time-Series Images Using Quantum-Inspired Computing Technology

Arxiv

0+阅读 · 2023年7月5日

A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms

Arxiv

0+阅读 · 2023年7月3日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

VIP会员

文章信息

相关主题

高斯混合（模型）

估计/估计量

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

【ACML2020】张量网络机器学习:最近的进展和前沿，109页ppt

专知会员服务

55+阅读 · 2020年12月15日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

从代码基础模型到智能体与应用：代码智能的全面综述与实践指南

《北约认知战概念报告》

【MIT博士论文】高效的视觉合成生成模型

美海军放弃星座级转而采用国家安全巡逻舰设计

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

44+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Expert Aggregation for Financial Forecasting

Arxiv

0+阅读 · 2023年7月6日

Bayesian D- and I-optimal designs for choice experiments involving mixtures and process variables

Arxiv

0+阅读 · 2023年7月5日

Clustering Method for Time-Series Images Using Quantum-Inspired Computing Technology

Arxiv

0+阅读 · 2023年7月5日

A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching Algorithms

Arxiv

0+阅读 · 2023年7月3日

Adversarial Mutual Information for Text Generation

Adversarial Mutual Information for Text Generation

Arxiv

13+阅读 · 2020年6月30日

相关基金

调控miRNA成熟通路中SNPs与新疆维吾尔族结核病遗传易感性及后续功能分析

国家自然科学基金

0+阅读 · 2014年12月31日

Cbl家族调控c-Met介导的非小细胞肺癌放疗抵抗机制的研究

国家自然科学基金

1+阅读 · 2014年12月31日

马尔可夫过程在Girsanov变换下的性质及其应用

国家自然科学基金

0+阅读 · 2012年12月31日

非线性波的时空复杂性研究

国家自然科学基金

0+阅读 · 2012年12月31日

非磁性元素掺杂稀磁半导体铁磁性机理研究的新方法

国家自然科学基金

0+阅读 · 2012年12月31日

微信扫码咨询专知VIP会员