各方面对电子计算决策树复杂程度的影响 (The Influence of Dimensions on the Complexity of Computing Decision Trees) - 专知论文

会员服务 ·

0

特征空间 · 决策树 · 训练数据 · 类别 · 类标记 ·

2022 年 5 月 16 日

The Influence of Dimensions on the Complexity of Computing Decision Trees

翻译：各方面对电子计算决策树复杂程度的影响

Stephen G. Kobourov,Maarten Löffler,Fabrizio Montecchiani,Marcin Pilipczuk,Ignaz Rutter,Raimund Seidel,Manuel Sorge,Jules Wulms

A decision tree recursively splits a feature space $\mathbb{R}^{d}$ and then assigns class labels based on the resulting partition. Decision trees have been part of the basic machine-learning toolkit for decades. A large body of work treats heuristic algorithms to compute a decision tree from training data, usually minimizing in particular the size of the resulting tree. In contrast, little is known about the complexity of the underlying computational problem of computing a minimum-size tree for the given training data. We study this problem with respect to the number $d$ of dimensions of the feature space. We show that it can be solved in $O(n^{2d + 1})$ time, but under reasonable complexity-theoretic assumptions it is not possible to achieve $f(d) \cdot n^{o(d / \log d)}$ running time, where $n$ is the number of training examples. The problem is solvable in $(dR)^{O(dR)} \cdot n^{1+o(1)}$ time if there are exactly two classes and $R$ is the upper bound on the number of tree leaves labeled with the smallest class.

翻译：决定树依次分割一个特性空间 $\ mathbb{R ⁇ d} $, 然后根据由此产生的分区分配类标签。决定树是基本机器学习工具几十年来的一部分。大量的工作处理从培训数据中计算决定树的超自然算法, 通常会特别将结果树的大小最小化。相反, 对为给定的培训数据计算最小尺寸树的内在计算问题的复杂性知之甚少。我们研究关于特性空间维度的 $d$ 的问题。我们显示它可以用$( n ⁇ 2d + 1} 来解决, 但是在合理的复杂理论假设下, 无法实现 $f(d)\ cdot n ⁇ o(d/\log d)} 运行时间, $n美元是培训实例的数量。这个问题在 $( dR) {O (dR)}\ cdot n ⁇ 1+o} 美元问题可以用$( 美元) 来解决。我们显示它可以用$( ) $( cdo) $( $) least time legleg lear lear lears ( ) lesh) 。如果最小的标签上有两个固定的标签是固定的。

0

相关内容

特征空间

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

非线性Schordinger方程及其相关问题的变分方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形的曲率与拓扑关系研究

国家自然科学基金

2+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

关于Cayley图的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

几个非线性Schrodinger方程组模型及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

BEC-BCS交叉中超流费米气体集体激发的Landau阻尼和频移

国家自然科学基金

0+阅读 · 2012年12月31日

共价键型锆、钛基惰性基体材料的电场强化低温烧结研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的对称性与曲面嵌入

国家自然科学基金

0+阅读 · 2009年12月31日

两椭球及两四次圆纹面位置关系的分类和计算问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

Complexity of optimizing over the integers

Arxiv

0+阅读 · 2022年7月5日

On the Number of Quantifiers as a Complexity Measure

Arxiv

0+阅读 · 2022年7月4日

Origins of Low-dimensional Adversarial Perturbations

Arxiv

0+阅读 · 2022年7月4日

On the Number of Incidences when Avoiding the Klan

Arxiv

0+阅读 · 2022年7月3日

On the complexity of backward smoothing algorithms

Arxiv

0+阅读 · 2022年7月3日

On Convergence of Gradient Descent Ascent: A Tight Local Analysis

Arxiv

0+阅读 · 2022年7月3日

PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Arxiv

0+阅读 · 2022年7月2日

Community detection and percolation of information in a geometric setting

Arxiv

0+阅读 · 2022年7月1日

The maximum capability of a topological feature in link prediction

Arxiv

0+阅读 · 2022年7月1日

The Complexity of Evaluating nfer

Arxiv

0+阅读 · 2022年7月1日

VIP会员

文章信息

相关主题

相关VIP内容

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

不可错过！《机器学习100讲》课程，UBC Mark Schmidt讲授

专知会员服务

75+阅读 · 2022年6月28日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

图像分类技巧集，17页ppt《Bag of Tricks for Image Classification》

专知会员服务

95+阅读 · 2020年3月12日

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

【新书：机器学习简介】《A Concise Introduction to Machine Learning》by A.C. Faul (CRC 2019)

专知会员服务

77+阅读 · 2020年2月8日

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

社交网络上议题社群的公共焦虑研究，中国人民大学新闻学院塔娜讲师，第八届全国社会媒体处理大会SMP2019

专知会员服务

15+阅读 · 2019年10月23日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《使用量化测量将传感器节点关联到融合中心的算法设计》171页

军事前沿模型

提升军事训练能力的最佳人工智能模拟工具

《社交媒体信息作战》最新48页技术报告

相关资讯

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium9

中国图象图形学学会CSIG

0+阅读 · 2021年12月17日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium7

中国图象图形学学会CSIG

0+阅读 · 2021年11月15日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium3

中国图象图形学学会CSIG

0+阅读 · 2021年11月9日

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

【ICIG2021】Check out the hot new trailer of ICIG2021 Symposium1

中国图象图形学学会CSIG

0+阅读 · 2021年11月3日

【ICIG2021】Latest News & Announcements of the Plenary Talk2

【ICIG2021】Latest News & Announcements of the Plenary Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年11月2日

【ICIG2021】Latest News & Announcements of the Plenary Talk1

【ICIG2021】Latest News & Announcements of the Plenary Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年11月1日

【ICIG2021】Latest News & Announcements of the Industry Talk2

【ICIG2021】Latest News & Announcements of the Industry Talk2

中国图象图形学学会CSIG

0+阅读 · 2021年7月29日

【ICIG2021】Latest News & Announcements of the Industry Talk1

【ICIG2021】Latest News & Announcements of the Industry Talk1

中国图象图形学学会CSIG

0+阅读 · 2021年7月28日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

相关论文

Complexity of optimizing over the integers

Arxiv

0+阅读 · 2022年7月5日

On the Number of Quantifiers as a Complexity Measure

Arxiv

0+阅读 · 2022年7月4日

Origins of Low-dimensional Adversarial Perturbations

Arxiv

0+阅读 · 2022年7月4日

On the Number of Incidences when Avoiding the Klan

Arxiv

0+阅读 · 2022年7月3日

On the complexity of backward smoothing algorithms

Arxiv

0+阅读 · 2022年7月3日

On Convergence of Gradient Descent Ascent: A Tight Local Analysis

Arxiv

0+阅读 · 2022年7月3日

PhilaeX: Explaining the Failure and Success of AI Models in Malware Detection

Arxiv

0+阅读 · 2022年7月2日

Community detection and percolation of information in a geometric setting

Arxiv

0+阅读 · 2022年7月1日

The maximum capability of a topological feature in link prediction

Arxiv

0+阅读 · 2022年7月1日

The Complexity of Evaluating nfer

Arxiv

0+阅读 · 2022年7月1日

相关基金

非线性Schordinger方程及其相关问题的变分方法研究

国家自然科学基金

1+阅读 · 2014年12月31日

蓖麻矮化相关RcDof基因功能分析及调控机制研究

国家自然科学基金

0+阅读 · 2014年12月31日

黎曼流形的曲率与拓扑关系研究

国家自然科学基金

2+阅读 · 2013年12月31日

Schrodinger-Poisson方程的若干问题研究

国家自然科学基金

1+阅读 · 2012年12月31日

关于Cayley图的若干研究

国家自然科学基金

0+阅读 · 2012年12月31日

几个非线性Schrodinger方程组模型及相关问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

BEC-BCS交叉中超流费米气体集体激发的Landau阻尼和频移

国家自然科学基金

0+阅读 · 2012年12月31日

共价键型锆、钛基惰性基体材料的电场强化低温烧结研究

国家自然科学基金

0+阅读 · 2011年12月31日

图的对称性与曲面嵌入

国家自然科学基金

0+阅读 · 2009年12月31日

两椭球及两四次圆纹面位置关系的分类和计算问题的研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员