可解释集束法的近近紧和明显数值 (Nearly-Tight and Oblivious Algorithms for Explainable Clustering) - 专知论文

会员服务 ·

0

簇 · 数据点 · 内部结点 · 分解的 · ICML 2020 ·

2021 年 10 月 24 日

Nearly-Tight and Oblivious Algorithms for Explainable Clustering

翻译：可解释集束法的近近紧和明显数值

Buddhima Gamlath,Xinrui Jia,Adam Polak,Ola Svensson

We study the problem of explainable clustering in the setting first formalized by Dasgupta, Frost, Moshkovitz, and Rashtchian (ICML 2020). A $k$-clustering is said to be explainable if it is given by a decision tree where each internal node splits data points with a threshold cut in a single dimension (feature), and each of the $k$ leaves corresponds to a cluster. We give an algorithm that outputs an explainable clustering that loses at most a factor of $O(\log^2 k)$ compared to an optimal (not necessarily explainable) clustering for the $k$-medians objective, and a factor of $O(k \log^2 k)$ for the $k$-means objective. This improves over the previous best upper bounds of $O(k)$ and $O(k^2)$, respectively, and nearly matches the previous $\Omega(\log k)$ lower bound for $k$-medians and our new $\Omega(k)$ lower bound for $k$-means. The algorithm is remarkably simple. In particular, given an initial not necessarily explainable clustering in $\mathbb{R}^d$, it is oblivious to the data points and runs in time $O(dk \log^2 k)$, independent of the number of data points $n$. Our upper and lower bounds also generalize to objectives given by higher $\ell_p$-norms.

翻译：我们首先研究Dasgupta、Frost、Moshkovitz和Rashtchian(ICML 2020)在设定中正式化的可解释的分组问题。如果一个决定树给出了美元组合, 每一个内部节点将数据点分割成一个单一尺寸的阈值( 功能), 而每张美元叶叶对应一个组。我们给出了一个算法, 输出一个可解释的分组, 与美元- 中间值的最佳( 不一定可以解释) 组合相比, 美元- 中间值目标的最佳( 不一定可以解释) 和美元- 美元- 中间值目标的美元( log_ 2 k) 组合是可以解释的。这比以前美元( k) 美元和美元( k) 叶) 叶( k) 叶( k) 叶( log) ( k) 美元- 中间值( log) 和美元( 美元) 美元( 美元) 的最小( 美元) 定义值) 的最小值值值和美元( 美元) 美元) 数字的内, 数字的内, 一定简单解释。

0

相关内容

【NeurIPS2021】存在潜在变量和选择偏差的递归因果结构学习

【NeurIPS2021】存在潜在变量和选择偏差的递归因果结构学习

专知会员服务

22+阅读 · 2021年11月15日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

专知会员服务

44+阅读 · 2020年7月19日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

图机器学习导论，69页ppt，An introduction to machine learning on graphs

图机器学习导论，69页ppt，An introduction to machine learning on graphs

专知会员服务

382+阅读 · 2019年12月27日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

Online Graph Algorithms with Predictions

Arxiv

0+阅读 · 2021年12月22日

Subtrajectory Clustering: Finding Set Covers for Set Systems of Subcurves

Arxiv

0+阅读 · 2021年12月22日

Dynamical Programming for off-the-grid dynamic Inverse Problems

Arxiv

0+阅读 · 2021年12月21日

Efficient reductions and algorithms for variants of Subset Sum

Arxiv

0+阅读 · 2021年12月21日

Lower Bounds for Sparse Oblivious Subspace Embeddings

Arxiv

0+阅读 · 2021年12月21日

Regularity based spectral clustering and mapping the Fiedler-carpet

Regularity based spectral clustering and mapping the Fiedler-carpet

Arxiv

0+阅读 · 2021年12月20日

Parameterized Approximation Algorithms for $k$-Center Clustering and Variants

Arxiv

0+阅读 · 2021年12月19日

Solving parametric systems of polynomial equations over the reals through Hermite matrices

Arxiv

0+阅读 · 2021年12月16日

Explainable Recommendation: A Survey and New Perspectives

Explainable Recommendation: A Survey and New Perspectives

Arxiv

66+阅读 · 2019年8月15日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

VIP会员

文章信息

相关主题

相关VIP内容

【NeurIPS2021】存在潜在变量和选择偏差的递归因果结构学习

【NeurIPS2021】存在潜在变量和选择偏差的递归因果结构学习

专知会员服务

22+阅读 · 2021年11月15日

【ICML2021】异质风险最小化，Heterogeneous Risk Minimization

专知会员服务

16+阅读 · 2021年5月21日

Python图像处理，366页pdf，Image Operators Image Processing in Python

Python图像处理，366页pdf，Image Operators Image Processing in Python

专知会员服务

77+阅读 · 2020年7月23日

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

【Java实现遗传算法】162页pdf，Genetic Algorithms in Java Basics

专知会员服务

44+阅读 · 2020年7月19日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

可解释强化学习，Explainable Reinforcement Learning: A Survey

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

【基于模型的强化学习的博弈论框架】A Game Theoretic Framework for Model Based Reinforcement Learning

专知会员服务

131+阅读 · 2020年4月19日

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

【CVPR2020】用于图像超分辨率的深度展开网络，Deep Unfolding Network for Image Super-Resolution

专知会员服务

44+阅读 · 2020年3月26日

图机器学习导论，69页ppt，An introduction to machine learning on graphs

图机器学习导论，69页ppt，An introduction to machine learning on graphs

专知会员服务

382+阅读 · 2019年12月27日

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

【加州大学伯克利分校博士论文】通过自我监督预测学习泛化

专知会员服务

65+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《物联网（IoT）中的无人机通信高效控制》135页

《在GNSS信号降级环境中利用共识实现无人机集群稳健协调》

中程单向攻击无人机的战略意义：俄乌战争启示

《面向无人机集群的避障动态传感器覆盖算法》最新38页

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

已删除

将门创投

5+阅读 · 2019年4月29日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【学习】(Python)SVM数据分类

【学习】(Python)SVM数据分类

机器学习研究会

6+阅读 · 2017年10月15日

相关论文

Online Graph Algorithms with Predictions

Arxiv

0+阅读 · 2021年12月22日

Subtrajectory Clustering: Finding Set Covers for Set Systems of Subcurves

Arxiv

0+阅读 · 2021年12月22日

Dynamical Programming for off-the-grid dynamic Inverse Problems

Arxiv

0+阅读 · 2021年12月21日

Efficient reductions and algorithms for variants of Subset Sum

Arxiv

0+阅读 · 2021年12月21日

Lower Bounds for Sparse Oblivious Subspace Embeddings

Arxiv

0+阅读 · 2021年12月21日

Regularity based spectral clustering and mapping the Fiedler-carpet

Regularity based spectral clustering and mapping the Fiedler-carpet

Arxiv

0+阅读 · 2021年12月20日

Parameterized Approximation Algorithms for $k$-Center Clustering and Variants

Arxiv

0+阅读 · 2021年12月19日

Solving parametric systems of polynomial equations over the reals through Hermite matrices

Arxiv

0+阅读 · 2021年12月16日

Explainable Recommendation: A Survey and New Perspectives

Explainable Recommendation: A Survey and New Perspectives

Arxiv

66+阅读 · 2019年8月15日

Optimal Algorithms for Non-Smooth Distributed Optimization in Networks

Arxiv

7+阅读 · 2018年6月1日

微信扫码咨询专知VIP会员