新的基于核密度估计的点间距离聚类算法 (A new interpoint distance-based clustering algorithm using kernel density estimation) - 专知论文

会员服务 ·

0

聚类算法 · 核密度估计 · 密度估计 · 非参数 · 算法 ·

2023 年 4 月 28 日

A new interpoint distance-based clustering algorithm using kernel density estimation

翻译：新的基于核密度估计的点间距离聚类算法

Dr. Soumita Modak

from arxiv, 10 pages, 7 figures

A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric univariate kernel density estimation method to the interpoint distances to estimate the density around a data member. Our clustering algorithm is simple in its formation and easy to apply resulting in well-defined clusters. The algorithm starts with objective selection of the initial cluster representative and always converges independently of this choice. The method finds the number of clusters itself and can be used irrespective of the nature of underlying data by using an appropriate interpoint distance measure. The cluster analysis can be carried out in any dimensional space with viability to high-dimensional use. The distributions of the data or their interpoint distances are not required to be known due to the design of our procedure, except the assumption that the interpoint distances possess a density function. Data study shows its effectiveness and superiority over the widely used clustering algorithms.

翻译：提出一种新的非参数聚类算法，利用数据成员之间的点间距离揭示给定数据中存在的内在聚类结构，其中，我们采用经典的非参数单变量核密度估计方法来估计数据成员周围的密度。我们的聚类算法形成简单，易于应用，结果导致明确的聚类。该算法始于对初始集群代表的客观选择，并且始终独立于该选择而收敛。该方法本身找到聚类数，并且可以使用适当的点间距离度量独立于底层数据的性质进行使用。由于我们的过程设计，数据或其点间距离的分布不需要被知道，除了假设点间距离具有密度函数。数据研究表明，我们的方法比广泛使用的聚类算法更有效，更优越。

0

相关内容

聚类算法

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

干货书！基于单调算子的大规模凸优化，348页pdf

干货书！基于单调算子的大规模凸优化，348页pdf

专知会员服务

50+阅读 · 2022年7月24日

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

专知会员服务

18+阅读 · 2022年3月28日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

结缔组织生长因子（CTGF)在再生障碍性贫血发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

函数空间、几何和Mahler测度

国家自然科学基金

0+阅读 · 2014年12月31日

函数空间中关于积分算子的Wiener引理及有界性的研究

国家自然科学基金

1+阅读 · 2014年12月31日

多视图下的个体自适应心电图分类方法

国家自然科学基金

2+阅读 · 2013年12月31日

高维环境中随机核矩阵的谱分析

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

量子自旋格子系统的拓扑序、量子动力学和量子quench

国家自然科学基金

0+阅读 · 2012年12月31日

广义度量方法及其在D空间和传感器最优布局问题中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

Hierarchical decompositions of implicational bases for the enumeration of meet-irreducible elements

Arxiv

0+阅读 · 2023年6月15日

A Bayesian approach to uncertainty in word embedding bias estimation

Arxiv

0+阅读 · 2023年6月15日

Kernel Debiased Plug-in Estimation

Arxiv

0+阅读 · 2023年6月14日

Bayesian inversion for Electrical Impedance Tomography by sparse interpolation

Arxiv

0+阅读 · 2023年6月14日

LASSO reloaded: a variational analysis perspective with applications to compressed sensing

Arxiv

0+阅读 · 2023年6月14日

New Optimal Results on Codes for Location in Graphs

Arxiv

0+阅读 · 2023年6月13日

Estimation Beyond Data Reweighting: Kernel Method of Moments

Arxiv

0+阅读 · 2023年6月13日

ELF Codes: Concatenated Codes with an Expurgating Linear Function as the Outer Code

Arxiv

0+阅读 · 2023年6月12日

Weapon Engagement Zone Maximum Launch Range Estimation Using a Deep Neural Network

Arxiv

19+阅读 · 2021年11月17日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

VIP会员

文章信息

相关主题

核密度估计

相关VIP内容

【2023新书】使用Python进行统计和数据可视化，554页pdf

【2023新书】使用Python进行统计和数据可视化，554页pdf

专知会员服务

130+阅读 · 2023年1月29日

干货书！基于单调算子的大规模凸优化，348页pdf

干货书！基于单调算子的大规模凸优化，348页pdf

专知会员服务

50+阅读 · 2022年7月24日

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

【WWW2022】图上的聚类感知的监督对比学习，ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs

专知会员服务

18+阅读 · 2022年3月28日

【硬核书】矩阵代数基础，248页pdf

【硬核书】矩阵代数基础，248页pdf

专知会员服务

88+阅读 · 2021年12月9日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

45+阅读 · 2020年12月18日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

【ICCV2025教程】基础模型遇见具身智能体

军事机器学习设计：关于开发自动化任务摘要系统的梯次化设计科学研究 | 2025最新93页

扩散模型中的缓存方法综述：迈向高效的多模态生成

【ICCV2025教程】《迈向视觉语言模型的全面推理》

相关资讯

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

特征筛选还在用XGB的Feature Importance？试试Permutation Importance

PaperWeekly

0+阅读 · 2022年9月30日

局部学习的特征选择：Local-Learning-Based Feature Selection

局部学习的特征选择：Local-Learning-Based Feature Selection

我爱读PAMI

14+阅读 · 2019年9月20日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

时序数据异常检测工具/数据集大列表

时序数据异常检测工具/数据集大列表

极市平台

65+阅读 · 2019年2月23日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

无监督元学习表示学习

无监督元学习表示学习

CreateAMind

27+阅读 · 2019年1月4日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

相关论文

Hierarchical decompositions of implicational bases for the enumeration of meet-irreducible elements

Arxiv

0+阅读 · 2023年6月15日

A Bayesian approach to uncertainty in word embedding bias estimation

Arxiv

0+阅读 · 2023年6月15日

Kernel Debiased Plug-in Estimation

Arxiv

0+阅读 · 2023年6月14日

Bayesian inversion for Electrical Impedance Tomography by sparse interpolation

Arxiv

0+阅读 · 2023年6月14日

LASSO reloaded: a variational analysis perspective with applications to compressed sensing

Arxiv

0+阅读 · 2023年6月14日

New Optimal Results on Codes for Location in Graphs

Arxiv

0+阅读 · 2023年6月13日

Estimation Beyond Data Reweighting: Kernel Method of Moments

Arxiv

0+阅读 · 2023年6月13日

ELF Codes: Concatenated Codes with an Expurgating Linear Function as the Outer Code

Arxiv

0+阅读 · 2023年6月12日

Weapon Engagement Zone Maximum Launch Range Estimation Using a Deep Neural Network

Arxiv

19+阅读 · 2021年11月17日

IEOPF: An Active Contour Model for Image Segmentation with Inhomogeneities Estimated by Orthogonal Primary Functions

Arxiv

10+阅读 · 2018年1月20日

相关基金

结缔组织生长因子（CTGF)在再生障碍性贫血发病机制中的作用研究

国家自然科学基金

0+阅读 · 2015年12月31日

函数空间、几何和Mahler测度

国家自然科学基金

0+阅读 · 2014年12月31日

函数空间中关于积分算子的Wiener引理及有界性的研究

国家自然科学基金

1+阅读 · 2014年12月31日

多视图下的个体自适应心电图分类方法

国家自然科学基金

2+阅读 · 2013年12月31日

高维环境中随机核矩阵的谱分析

国家自然科学基金

0+阅读 · 2013年12月31日

高维数据特征选择的稳定性研究

国家自然科学基金

0+阅读 · 2012年12月31日

函数域中的Vinogradov中值定理

国家自然科学基金

0+阅读 · 2012年12月31日

量子自旋格子系统的拓扑序、量子动力学和量子quench

国家自然科学基金

0+阅读 · 2012年12月31日

广义度量方法及其在D空间和传感器最优布局问题中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

遍历哈密顿系统的谱理论

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员