呼吸 K- Means (Breathing K-Means) - 专知论文

会员服务 ·

0

Scikit-learn · Better · SOTA · 近似 · 原点 ·

2021 年 10 月 10 日

Breathing K-Means

翻译：呼吸 K- Means

from arxiv, 55 pages, 45 figures, Relevant Changes: Algorithm is now better *and* faster than the underlying BKMeans class from scikit-learn. Detailed analysis of parameter m shows that it can be used to balance SSE and CPU time; Parameter theta eliminated. Submitted to JMLR; Implementation: https://github.com/gittar/breathing-k-means; Python package: https://pypi.org/project/bkmeans

The k-means++ algorithm is the de-facto standard for finding approximate solutions to the k-means problem. A widely used implementation is provided by the scikit-learn Python package for machine learning. We propose the breathing k-means algorithm, which on average significantly outperforms scikit-learn's k-means++ w.r.t. both solution quality and execution speed. The initialization step in the new method is done by k-means++ but without the usual (and costly) repetitions (ten in scikit-learn). The core of the new method is a sequence of "breathing cycles," each consisting of a "breathe in" step where the number of centroids is increased by m and a "breathe out" step where m centroids are removed. Each step is ended by a run of Lloyd's algorithm. The parameter m is decreased until zero, at which point the algorithm terminates. With the default (m = 5), breathing k-means dominates scikit-learn's k-means++. This is demonstrated via experiments on various data sets, including all those from the original k-means++ publication. By setting m to smaller or larger values, one can optionally produce faster or better solutions, respectively. For larger values of m, e.g., m = 20, breathing k-means likely is the new SOTA for the k-means problem.

翻译：k- means++ 算法是寻找 k- point 问题近似解决方案的 defacto 标准。由 scikit- learn Python 软件包为机器学习提供广泛使用的执行。我们提出呼吸 kpoys 算法, 平均明显优于 scikit- learn k- moys++ w.r. t. 的解决方案质量和执行速度。新方法的初始化步骤由 k- poys++ 完成, 但没有通常的( 10 scikit- learn ) 重复( 10 ) 。新方法的核心是“ 呼吸周期” 的序列。我们建议使用呼吸 k- points 算法, 平均优于 sikit- modal 的“ breathe mreathe mologies” 。参数 mreax mile male lax to new, oral due orals to the new rudeal rudeal rudeal- klives.

0

相关内容

Scikit-learn

Scikit-learn项目最早由数据科学家David Cournapeau 在2007 年发起，需要NumPy和SciPy等其他包的支持，是Python语言中专门针对机器学习应用而发展起来的一款开源框架。

机器学习简明导论，62页pdf

专知会员服务

83+阅读 · 2021年7月31日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

专知会员服务

191+阅读 · 2020年5月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

动手写机器学习算法：K-Means聚类算法

动手写机器学习算法：K-Means聚类算法

七月在线实验室

5+阅读 · 2017年12月6日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

Arxiv

0+阅读 · 2021年12月3日

The Hitchhiker's Guide to Prior-Shift Adaptation

Arxiv

0+阅读 · 2021年12月3日

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Arxiv

20+阅读 · 2021年5月27日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Transformation Driven Visual Reasoning

Arxiv

3+阅读 · 2020年11月26日

Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks

Arxiv

6+阅读 · 2020年6月15日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning

Arxiv

5+阅读 · 2018年12月30日

Rethinking ImageNet Pre-training

Arxiv

8+阅读 · 2018年11月21日

VIP会员

文章信息

相关主题

相关VIP内容

机器学习简明导论，62页pdf

专知会员服务

83+阅读 · 2021年7月31日

哥伦比亚大学最新《机器学习》课程，Fall-B 2020 (Machine Learning)

专知会员服务

39+阅读 · 2020年11月3日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

【开放书】贝叶斯推理与机器学习，690页pdf，Bayesian Reasoning and Machine Learning

专知会员服务

191+阅读 · 2020年5月30日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

【牛津博士论文】零样本强化学习综述

《美军条令：陆军指挥官与规划人员地理空间指南》60页

战术边缘指挥控制：防务面临的核心挑战

迈向开放世界检测：综述

相关资讯

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

深度自进化聚类：Deep Self-Evolution Clustering

深度自进化聚类：Deep Self-Evolution Clustering

我爱读PAMI

15+阅读 · 2019年4月13日

meta learning 17年：MAML SNAIL

meta learning 17年：MAML SNAIL

CreateAMind

11+阅读 · 2019年1月2日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

视觉机械臂 visual-pushing-grasping

视觉机械臂 visual-pushing-grasping

CreateAMind

3+阅读 · 2018年5月25日

动手写机器学习算法：K-Means聚类算法

动手写机器学习算法：K-Means聚类算法

七月在线实验室

5+阅读 · 2017年12月6日

Capsule Networks解析

Capsule Networks解析

机器学习研究会

11+阅读 · 2017年11月12日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Andrew NG的新书《Machine Learning Yearning》

Andrew NG的新书《Machine Learning Yearning》

我爱机器学习

11+阅读 · 2016年12月7日

相关论文

On the Unreasonable Effectiveness of Feature propagation in Learning on Graphs with Missing Node Features

Arxiv

0+阅读 · 2021年12月3日

The Hitchhiker's Guide to Prior-Shift Adaptation

Arxiv

0+阅读 · 2021年12月3日

QA-GNN: Reasoning with Language Models and Knowledge Graphs for Question Answering

Arxiv

20+阅读 · 2021年5月27日

Neural Collaborative Reasoning

Arxiv

13+阅读 · 2021年5月3日

Transformation Driven Visual Reasoning

Arxiv

3+阅读 · 2020年11月26日

Optimization and Generalization Analysis of Transduction through Gradient Boosting and Application to Multi-scale Graph Neural Networks

Arxiv

6+阅读 · 2020年6月15日

Differentiable Reasoning on Large Knowledge Bases and Natural Language

Arxiv

12+阅读 · 2019年12月17日

Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks

Arxiv

14+阅读 · 2019年8月8日

Go for a Walk and Arrive at the Answer: Reasoning Over Paths in Knowledge Bases using Reinforcement Learning

Arxiv

5+阅读 · 2018年12月30日

Rethinking ImageNet Pre-training

Arxiv

8+阅读 · 2018年11月21日

微信扫码咨询专知VIP会员