最优稀疏回归树 (Optimal Sparse Regression Trees) - 专知论文

会员服务 ·

0

最优稀疏 · 稀疏回归 · 最优 · 稀疏 · 聚类算法 ·

2023 年 4 月 10 日

Optimal Sparse Regression Trees

翻译：最优稀疏回归树

Rui Zhang,Rui Xin,Margo Seltzer,Cynthia Rudin

from arxiv, AAAI 2023, final archival version

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees, there has been little effort towards full provable optimization, mainly due to the computational hardness of the problem. This work proposes a dynamic-programming-with-bounds approach to the construction of provably-optimal sparse regression trees. We leverage a novel lower bound based on an optimal solution to the k-Means clustering algorithm in 1-dimension over the set of labels. We are often able to find optimal sparse trees in seconds, even for challenging datasets that involve large numbers of samples and highly-correlated features.

翻译：回归树是最古老的人工智能模型之一，其预测结果可实现无需计算器，因此被广泛应用于高风险应用。尽管回归树的文献非常庞大，但由于问题的计算难度，很少有尝试实现完全可证的优化。该论文提出了一种动态规划加上搜索范围边界的方法来构建可证的最优稀疏回归树。我们利用了一种基于标签集上一维k均值聚类算法的最优解的新型下界。即使对于包含大量样本和高相关特征的挑战性数据集，我们通常也能在几秒钟内找到最优的稀疏树。

0

相关内容

最优稀疏

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

111+阅读 · 2020年11月12日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

专知

17+阅读 · 2018年6月7日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

高维数据下多因变量回归模型的统计推断

国家自然科学基金

5+阅读 · 2013年12月31日

GOCE引力梯度数据的时间序列分析与误差处理

国家自然科学基金

0+阅读 · 2013年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

基于弱线性回归树在线学习的自适应视频目标检测算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

全息视频显示的LCOS原理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于数据集“粒结构”和几何结构的子空间学习算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

约束优化问题的拉格朗日乘子理论与算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

时变系统中的盲信号处理问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

Robust estimation for functional quadratic regression models

Arxiv

0+阅读 · 2023年5月29日

Exhaustive Symbolic Regression

Arxiv

0+阅读 · 2023年5月29日

Optimal subsampling for large scale Elastic-net regression

Optimal subsampling for large scale Elastic-net regression

Arxiv

0+阅读 · 2023年5月29日

A nonparametric regression alternative to empirical Bayes approaches to simultaneous estimation

Arxiv

0+阅读 · 2023年5月29日

Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

Arxiv

0+阅读 · 2023年5月28日

bqror: An R package for Bayesian Quantile Regression in Ordinal Models

Arxiv

0+阅读 · 2023年5月27日

Feature Adaptation for Sparse Linear Regression

Arxiv

0+阅读 · 2023年5月26日

Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression

Arxiv

0+阅读 · 2023年5月26日

Regression of binary network data with exchangeable latent errors

Arxiv

0+阅读 · 2023年5月25日

Simulating first-order phase transition with hierarchical autoregressive networks

Arxiv

0+阅读 · 2023年5月25日

VIP会员

文章信息

相关主题

相关VIP内容

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

宾夕法尼亚大学最新《不确定性估计》课程笔记，134页pdf，附Slides

专知会员服务

49+阅读 · 2022年11月13日

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

INRIA最新「机器学习理论」新书，229页pdf原理性阐述机器学习

专知会员服务

69+阅读 · 2021年3月27日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

【2020新书】概率机器学习，附212页pdf与slides

【2020新书】概率机器学习，附212页pdf与slides

专知会员服务

111+阅读 · 2020年11月12日

NLP必读经典文献100篇

专知会员服务

124+阅读 · 2020年9月8日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

人工智能治理的未来

模态感知的特征匹配：单一模态与跨模态技术的全面综述

无监督行人重识别研究综述

【牛津博士论文】面向神经影像应用的可扩展且可解释的空间模型

相关资讯

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

逆强化学习-学习人先验的动机

逆强化学习-学习人先验的动机

CreateAMind

16+阅读 · 2019年1月18日

【SIGIR2018】五篇对抗训练文章

【SIGIR2018】五篇对抗训练文章

专知

12+阅读 · 2018年7月9日

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

【论文推荐】最新六篇视觉问答相关论文—鲁棒性分析、虚拟意象、双曲注意力网络、R-VQA、关系推理、双线性注意力网络

专知

17+阅读 · 2018年6月7日

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

【泡泡一分钟】学习紧密的几何特征（ICCV2017-17）

泡泡机器人SLAM

20+阅读 · 2018年5月8日

【推荐】YOLO实时目标检测(6fps)

【推荐】YOLO实时目标检测(6fps)

机器学习研究会

20+阅读 · 2017年11月5日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】RNN/LSTM时序预测

【推荐】RNN/LSTM时序预测

机器学习研究会

25+阅读 · 2017年9月8日

【推荐】GAN架构入门综述(资源汇总)

【推荐】GAN架构入门综述(资源汇总)

机器学习研究会

10+阅读 · 2017年9月3日

【推荐】SVM实例教程

【推荐】SVM实例教程

机器学习研究会

17+阅读 · 2017年8月26日

相关论文

Robust estimation for functional quadratic regression models

Arxiv

0+阅读 · 2023年5月29日

Exhaustive Symbolic Regression

Arxiv

0+阅读 · 2023年5月29日

Optimal subsampling for large scale Elastic-net regression

Optimal subsampling for large scale Elastic-net regression

Arxiv

0+阅读 · 2023年5月29日

A nonparametric regression alternative to empirical Bayes approaches to simultaneous estimation

Arxiv

0+阅读 · 2023年5月29日

Multinomial Logistic Regression: Asymptotic Normality on Null Covariates in High-Dimensions

Arxiv

0+阅读 · 2023年5月28日

bqror: An R package for Bayesian Quantile Regression in Ordinal Models

Arxiv

0+阅读 · 2023年5月27日

Feature Adaptation for Sparse Linear Regression

Arxiv

0+阅读 · 2023年5月26日

Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression

Arxiv

0+阅读 · 2023年5月26日

Regression of binary network data with exchangeable latent errors

Arxiv

0+阅读 · 2023年5月25日

Simulating first-order phase transition with hierarchical autoregressive networks

Arxiv

0+阅读 · 2023年5月25日

相关基金

高维数据下多因变量回归模型的统计推断

国家自然科学基金

5+阅读 · 2013年12月31日

GOCE引力梯度数据的时间序列分析与误差处理

国家自然科学基金

0+阅读 · 2013年12月31日

采用pinball loss的MEE算法研究

国家自然科学基金

1+阅读 · 2013年12月31日

基于似然估计的梯度优化在变量带误差模型辨识中的收敛性分析

国家自然科学基金

0+阅读 · 2013年12月31日

基于弱线性回归树在线学习的自适应视频目标检测算法研究

国家自然科学基金

0+阅读 · 2013年12月31日

全息视频显示的LCOS原理研究

国家自然科学基金

0+阅读 · 2012年12月31日

基于数据集“粒结构”和几何结构的子空间学习算法研究

国家自然科学基金

0+阅读 · 2012年12月31日

约束优化问题的拉格朗日乘子理论与算法研究

国家自然科学基金

1+阅读 · 2011年12月31日

Dirichlet空间的分析与几何

国家自然科学基金

0+阅读 · 2011年12月31日

时变系统中的盲信号处理问题研究

国家自然科学基金

1+阅读 · 2011年12月31日

微信扫码咨询专知VIP会员