贝叶斯聚类：通过融合局部密度实现 (Bayesian Clustering via Fusing of Localized Densities) - 专知论文

会员服务 ·

0

局部密度 · 核函数 · 贝叶斯 · 融合 · MCMC ·

2023 年 3 月 31 日

Bayesian Clustering via Fusing of Localized Densities

翻译：贝叶斯聚类：通过融合局部密度实现

Alexander Dombowsky,David B. Dunson

Bayesian clustering typically relies on mixture models, with each component interpreted as a different cluster. After defining a prior for the component parameters and weights, Markov chain Monte Carlo (MCMC) algorithms are commonly used to produce samples from the posterior distribution of the component labels. The data are then clustered by minimizing the expectation of a clustering loss function that favours similarity to the component labels. Unfortunately, although these approaches are routinely implemented, clustering results are highly sensitive to kernel misspecification. For example, if Gaussian kernels are used but the true density of data within a cluster is even slightly non-Gaussian, then clusters will be broken into multiple Gaussian components. To address this problem, we develop Fusing of Localized Densities (FOLD), a novel clustering method that melds components together using the posterior of the kernels. FOLD has a fully Bayesian decision theoretic justification, naturally leads to uncertainty quantification, can be easily implemented as an add-on to MCMC algorithms for mixtures, and favours a small number of distinct clusters. We provide theoretical support for FOLD including clustering optimality under kernel misspecification. In simulated experiments and real data, FOLD outperforms competitors by minimizing the number of clusters while inferring meaningful group structure.

翻译：贝叶斯聚类通常依赖于混合模型，其中每个组件被解释为不同的簇。在为组件参数和权重定义先验后，通常使用马尔可夫链蒙特卡罗（MCMC）算法从组件标签的后验分布中产生样本。然后，通过最小化聚类损失函数的期望来将数据进行聚类，该函数有利于与组件标签的相似性。不幸的是，尽管这些方法经常被实现，但聚类结果对核函数规范化极为敏感。例如，如果使用高斯核但簇内数据的真实密度略微非高斯，则簇将被分成多个高斯组件。为了解决这个问题，我们开发了一种名为局部密度融合（FOLD）的新聚类方法，该方法使用核函数的后验将组件融合在一起。FOLD具有完全贝叶斯决策理论的依据，自然地导致不确定性量化，可以很容易地作为混合MCMC算法的附加组件实现，并有利于较少的不同簇数。我们提供了对FOLD的理论支持，包括在核函数规范化错误下的聚类最优性。在模拟实验和实际数据中，FOLD通过最小化聚类数并推断有意义的群体结构而优于竞争对手。

0

相关内容

局部密度

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

专知会员服务

18+阅读 · 2022年3月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

67+阅读 · 2021年9月12日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

3维Lorentz空间中的伪圆纹Willmore曲面与4维球面中的共形曲面论

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

气溶胶雷达比与波长指数间相关性研究及实验观测

国家自然科学基金

0+阅读 · 2013年12月31日

过渡金属化合物纳米材料表面增强拉曼光谱的实验和理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

信念的非修正处理方法及其自动推理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于时间序列特征的金融资产相依结构模型构建及应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

有限域上多项式的降次与P-adic估计、指数和

国家自然科学基金

0+阅读 · 2009年12月31日

基于核、正则化与多目标优化技术的多标签分类算法及其应用研究

国家自然科学基金

1+阅读 · 2008年12月31日

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Arxiv

0+阅读 · 2023年5月23日

funLOCI: a local clustering algorithm for functional data

Arxiv

0+阅读 · 2023年5月22日

Model Debiasing via Gradient-based Explanation on Representation

Arxiv

0+阅读 · 2023年5月20日

On the Relationship between Markov Switching Models and Fuzzy Clustering: a Nonparametric Method to Detect the Number of States

Arxiv

0+阅读 · 2023年5月20日

Multi-Objective Optimization Using the R2 Utility

Arxiv

0+阅读 · 2023年5月19日

Bayesian graph neural networks for strain-based crack localization

Arxiv

0+阅读 · 2023年5月19日

Multi-view Contrastive Graph Clustering

Arxiv

13+阅读 · 2021年10月22日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

Learning to Count Objects in Natural Images for Visual Question Answering

Arxiv

12+阅读 · 2018年2月15日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

76+阅读 · 2022年6月28日

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

专知会员服务

18+阅读 · 2022年3月28日

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

高效可扩展图神经网络的研究进展，Recent Advances in Efficient and Scalable Graph Neural Networks

专知会员服务

78+阅读 · 2022年3月15日

【干货书】计算成像，483页pdf，Computational Imaging Book, MIT 出版社

专知会员服务

67+阅读 · 2021年9月12日

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

【MIT】自监督几何感知，22页ppt，Self-supervised Geometric Perception

专知会员服务

23+阅读 · 2021年6月3日

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

神经网络的拓扑结构，TOPOLOGY OF DEEP NEURAL NETWORKS

专知会员服务

35+阅读 · 2020年4月15日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

【新书】数字图像(影像)处理手第二版，2176pdf，Mathematical Methods in Imaging

专知会员服务

93+阅读 · 2020年2月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《代码、指挥与冲突：描绘军事人工智能的未来》报告

【斯坦福博士论文】面向地理空间数据的多模态与多尺度建模：时空生成式人工智能

美国启动“自有军事人工智能计划”：采用谷歌Gemini以推动全军人工智能应用

《创新与适应性作为军事成功的关键因素：来自俄乌战争的战略洞见》报告

相关资讯

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

机器学习论文大全，涵盖深度学习、计算机视觉、分类、聚类、机器人学等

专知

17+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

disentangled-representation-papers

disentangled-representation-papers

CreateAMind

26+阅读 · 2018年9月12日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

相关论文

Multi-Stream Extension of Variational Bayesian HMM Clustering (MS-VBx) for Combined End-to-End and Vector Clustering-based Diarization

Arxiv

0+阅读 · 2023年5月23日

funLOCI: a local clustering algorithm for functional data

Arxiv

0+阅读 · 2023年5月22日

Model Debiasing via Gradient-based Explanation on Representation

Arxiv

0+阅读 · 2023年5月20日

On the Relationship between Markov Switching Models and Fuzzy Clustering: a Nonparametric Method to Detect the Number of States

Arxiv

0+阅读 · 2023年5月20日

Multi-Objective Optimization Using the R2 Utility

Arxiv

0+阅读 · 2023年5月19日

Bayesian graph neural networks for strain-based crack localization

Arxiv

0+阅读 · 2023年5月19日

Multi-view Contrastive Graph Clustering

Arxiv

13+阅读 · 2021年10月22日

Graph Self-Supervised Learning: A Survey

Arxiv

15+阅读 · 2021年8月5日

Meta-Learning to Cluster

Meta-Learning to Cluster

Arxiv

18+阅读 · 2019年10月30日

Learning to Count Objects in Natural Images for Visual Question Answering

Arxiv

12+阅读 · 2018年2月15日

相关基金

Forward-Looking与Backward-Looking相结合的投资组合管理

国家自然科学基金

1+阅读 · 2014年12月31日

3维Lorentz空间中的伪圆纹Willmore曲面与4维球面中的共形曲面论

国家自然科学基金

0+阅读 · 2014年12月31日

神经网络随机学习算法的泛化性研究

国家自然科学基金

2+阅读 · 2013年12月31日

气溶胶雷达比与波长指数间相关性研究及实验观测

国家自然科学基金

0+阅读 · 2013年12月31日

过渡金属化合物纳米材料表面增强拉曼光谱的实验和理论研究

国家自然科学基金

0+阅读 · 2012年12月31日

信念的非修正处理方法及其自动推理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于时间序列特征的金融资产相依结构模型构建及应用研究

国家自然科学基金

1+阅读 · 2012年12月31日

随机变分不等式

国家自然科学基金

0+阅读 · 2011年12月31日

有限域上多项式的降次与P-adic估计、指数和

国家自然科学基金

0+阅读 · 2009年12月31日

基于核、正则化与多目标优化技术的多标签分类算法及其应用研究

国家自然科学基金

1+阅读 · 2008年12月31日

微信扫码咨询专知VIP会员