拆分回归模型模型 (Split Regression Modeling) - 专知论文

会员服务 ·

0

预测器/决策函数 · MoDELS · 模型评估 · 统计量 · 稀疏 ·

2021 年 11 月 16 日

Split Regression Modeling

翻译：拆分回归模型模型

Anthony Christidis,Stefan Van Aelst,Ruben Zamar

In the statistical literature, sparse modeling is the standard approach to achieve improvements in prediction tasks and interpretability. Alternatively, in the seminal paper "Statistical Modeling: The Two Cultures," Breiman (2001) advocated for the adoption of algorithmic approaches to generate ensembles to achieve superior prediction accuracy than single-model methods at the cost of loss of interpretability. In a recent important and critical paper, Rudin (2019) argued that blackbox algorithmic approaches should be avoided for high-stakes decisions and that the tradeoff between accuracy and interpretability is a myth. In response to this recent change in philosophy, we generalize best subset selection (BSS) to best split selection (BSpS), a data-driven approach aimed at finding the optimal split of predictor variables among the models of an ensemble. The proposed methodology results in an ensemble of sparse and diverse models that provide possible mechanisms that explain the relationship between the predictors and the response. The high computational cost of BSpS motivates the need for computational tractable ways to approximate the exhaustive search, and we benchmark one such recent proposal by Christidis et al. (2020) based on a multi-convex relaxation. Our objective with this article is to motivate research in this new exciting field with great potential for data analysis tasks for high-dimensional data.

翻译：在统计文献中,稀有的模型是改进预测任务和可解释性的标准方法。或者,在“统计模型:两文化”这一开创性论文中,布雷曼(2001年)主张采用算法方法,产生比单一模型方法更精准的预测精度,以失去可解释性为代价。鲁丁(2019年)在最近的一份重要和批评性论文中认为,应当避免黑盒算法方法,以便作出高发决定,准确性和可解释性之间的权衡是一种神话。为了应对最近哲学的这一变化,我们推广了最佳子集选择(BS),以最佳的分选(BSPS)为最佳分选(BSPS),这是一种数据驱动的方法,目的是在共同模型模型中找到最佳的预测或变量组合组合,以损失可解释性为代价。在一份重要模型中,可以提供解释预测者与答复者之间关系的可能机制。BSpS的计算成本很高,这说明有必要用可计算的方法来接近彻底的搜索,我们将最佳的子集选择(BS)和最佳分选方法,我们将最近提出的一项由Christidi et et al (20年) 高层次研究中的数据与这项目标作为基准。

0

相关内容

预测器/决策函数

预测器/决策函数

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

专知会员服务

11+阅读 · 2019年12月28日

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

专知会员服务

12+阅读 · 2019年12月19日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机类 | APNOMS 2019等国际会议信息6条

计算机类 | APNOMS 2019等国际会议信息6条

Call4Papers

4+阅读 · 2019年4月15日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

计算机类 | LICS 2019等国际会议信息7条

计算机类 | LICS 2019等国际会议信息7条

Call4Papers

3+阅读 · 2018年12月17日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

Approximated Multi-Agent Fitted Q Iteration

Arxiv

0+阅读 · 2022年1月20日

Censored autoregressive regression models with Student-$t$ innovations

Arxiv

0+阅读 · 2022年1月19日

Poseur: Direct Human Pose Regression with Transformers

Arxiv

0+阅读 · 2022年1月19日

A zero-inflated endemic-epidemic model with an application to measles time series in Germany

Arxiv

0+阅读 · 2022年1月18日

Finite sample inference for generic autoregressive models

Arxiv

0+阅读 · 2022年1月18日

(Almost) All of Entity Resolution

Arxiv

0+阅读 · 2022年1月17日

A review and recommendations on variable selection methods in regression models for binary data

Arxiv

0+阅读 · 2022年1月16日

A generalized likelihood based Bayesian approach for scalable joint regression and covariance selection in high dimensions

Arxiv

0+阅读 · 2022年1月14日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement

Arxiv

4+阅读 · 2018年2月19日

VIP会员

文章信息

相关主题

预测器/决策函数

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

纽约大学最新《语音识别Speech Recognition》2020课程，不可错过！

专知会员服务

44+阅读 · 2020年11月2日

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

【ICLR2020】理解非自回归机器翻译中的知识蒸馏（Understanding Knowledge Distillation in Non-autoregressive Machine Translation）

专知会员服务

11+阅读 · 2019年12月28日

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

【NeurIPS 2019的主要趋势】Key trends from NeurIPS 2019

专知会员服务

12+阅读 · 2019年12月19日

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

Risk Sensitive Portfolio Optimization with Regime-Switching and Default Contagion，香港理工大学应用数学系余翔助理教授，第八届全国社会媒体处理大会SMP2019

专知会员服务

10+阅读 · 2019年10月24日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

【人工智能在2019：一年回顾】反人工智能，AI in 2019: A Year in Review

专知会员服务

79+阅读 · 2019年10月10日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

面向具身操作的视觉-语言-动作模型综述

《多域空战指挥体系：驾驭复杂性的艺术》

【博士论文】低维与高维空间中潜在表征的分析、建模与变换

《生态建模密码破译：建模与编程实践》美陆军最新报告

相关资讯

计算机 | 中低难度国际会议信息8条

计算机 | 中低难度国际会议信息8条

Call4Papers

9+阅读 · 2019年6月19日

计算机 | IUI 2020等国际会议信息4条

计算机 | IUI 2020等国际会议信息4条

Call4Papers

6+阅读 · 2019年6月17日

计算机 | USENIX Security 2020等国际会议信息5条

计算机 | USENIX Security 2020等国际会议信息5条

Call4Papers

7+阅读 · 2019年4月25日

计算机类 | APNOMS 2019等国际会议信息6条

计算机类 | APNOMS 2019等国际会议信息6条

Call4Papers

4+阅读 · 2019年4月15日

计算机 | CCF推荐期刊专刊信息5条

计算机 | CCF推荐期刊专刊信息5条

Call4Papers

3+阅读 · 2019年4月10日

计算机类 | LICS 2019等国际会议信息7条

计算机类 | LICS 2019等国际会议信息7条

Call4Papers

3+阅读 · 2018年12月17日

计算机类 | 期刊专刊截稿信息9条

计算机类 | 期刊专刊截稿信息9条

Call4Papers

4+阅读 · 2018年1月26日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

【计算机类】期刊专刊/国际会议截稿信息6条

【计算机类】期刊专刊/国际会议截稿信息6条

Call4Papers

3+阅读 · 2017年10月13日

【推荐】用Tensorflow理解LSTM

【推荐】用Tensorflow理解LSTM

机器学习研究会

36+阅读 · 2017年9月11日

相关论文

Approximated Multi-Agent Fitted Q Iteration

Arxiv

0+阅读 · 2022年1月20日

Censored autoregressive regression models with Student-$t$ innovations

Arxiv

0+阅读 · 2022年1月19日

Poseur: Direct Human Pose Regression with Transformers

Arxiv

0+阅读 · 2022年1月19日

A zero-inflated endemic-epidemic model with an application to measles time series in Germany

Arxiv

0+阅读 · 2022年1月18日

Finite sample inference for generic autoregressive models

Arxiv

0+阅读 · 2022年1月18日

(Almost) All of Entity Resolution

Arxiv

0+阅读 · 2022年1月17日

A review and recommendations on variable selection methods in regression models for binary data

Arxiv

0+阅读 · 2022年1月16日

A generalized likelihood based Bayesian approach for scalable joint regression and covariance selection in high dimensions

Arxiv

0+阅读 · 2022年1月14日

Human Pose Regression with Residual Log-likelihood Estimation

Arxiv

4+阅读 · 2021年7月26日

Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement

Arxiv

4+阅读 · 2018年2月19日

微信扫码咨询专知VIP会员