通过普遍化的U-统计学,随机森林的零散分布和趋同率 (Asymptotic Distributions and Rates of Convergence for Random Forests via Generalized U-statistics) - 专知论文

会员服务 ·

0

随机森林 · 子采样 · 规范化的 · 监督学习 · Performer ·

2021 年 11 月 16 日

Asymptotic Distributions and Rates of Convergence for Random Forests via Generalized U-statistics

翻译：通过普遍化的U-统计学,随机森林的零散分布和趋同率

Wei Peng,Tim Coleman,Lucas Mentch

from arxiv, 76 pages, 7 figure

Random forests remain among the most popular off-the-shelf supervised learning algorithms. Despite their well-documented empirical success, however, until recently, few theoretical results were available to describe their performance and behavior. In this work we push beyond recent work on consistency and asymptotic normality by establishing rates of convergence for random forests and other supervised learning ensembles. We develop the notion of generalized U-statistics and show that within this framework, random forest predictions can potentially remain asymptotically normal for larger subsample sizes than previously established. We also provide Berry-Esseen bounds in order to quantify the rate at which this convergence occurs, making explicit the roles of the subsample size and the number of trees in determining the distribution of random forest predictions.

翻译：随机森林仍然是最受欢迎的现成监管的学习算法之一。尽管它们的经验成功经验有据可查,但直到最近,它们的表现和行为却鲜有理论结果。在这项工作中,我们通过确定随机森林和其他受监督的学习组合的趋同率,超越了最近关于一致性和无症状常态的工作。我们发展了普遍的U-统计概念,并表明在这一框架内,随机森林预测对于比以前确定的更大次抽样规模而言可能保持无症状的正常。我们还提供了Berry-Es seeen界限,以便量化这种趋同的速率,明确了子抽样规模和树木数量在决定随机森林预测分布方面的作用。

0

相关内容

随机森林

随机森林指的是利用多棵树对样本进行训练并预测的一种分类器。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

专知会员服务

28+阅读 · 2019年11月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

Geometrically adapted Langevin dynamics for Markov chain Monte Carlo simulations

Arxiv

0+阅读 · 2022年1月20日

Strong error analysis of Euler methods for overdamped generalized Langevin equations with fractional noise: Nonlinear case

Arxiv

0+阅读 · 2022年1月19日

Generalized XGBoost Method

Arxiv

0+阅读 · 2022年1月19日

On the Convergence Rates of Policy Gradient Methods

Arxiv

0+阅读 · 2022年1月19日

On Centralized and Distributed Mirror Descent: Convergence Analysis Using Quadratic Constraints

Arxiv

0+阅读 · 2022年1月18日

No-Go Theorems for Distributive Laws

Arxiv

0+阅读 · 2022年1月17日

Risk bounds for PU learning under Selected At Random assumption

Arxiv

0+阅读 · 2022年1月17日

Reversible Random Walks on Dynamic Graphs

Arxiv

0+阅读 · 2022年1月17日

SGLB: Stochastic Gradient Langevin Boosting

Arxiv

0+阅读 · 2022年1月16日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

VIP会员

文章信息

相关主题

相关VIP内容

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

2020数据工程师成长路线图

专知会员服务

41+阅读 · 2020年9月6日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

80+阅读 · 2020年7月26日

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

生成性对抗网络:理论模型、评估指标和最近发展的概述，Generative Adversarial Networks (GANs): An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments

专知会员服务

42+阅读 · 2020年5月30日

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

【CVPR2020】视觉跟踪的概率回归，Probabilistic Regression for Visual Tracking

专知会员服务

37+阅读 · 2020年3月27日

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

微软发布DialoGPT预训练语言模型，论文与代码 Large-Scale Generative Pre-training for Conversational Response Generation

专知会员服务

28+阅读 · 2019年11月8日

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

Auto-Sizing the Transformer Network: Improving Speed, Efficiency, and Performance for Low-Resource Machine Translation

专知会员服务

49+阅读 · 2019年10月17日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

181+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

热门VIP内容

开通专知VIP会员享更多权益服务

《生成式人工智能与大/小语言模型在供应链管理决策优化与可持续性提升中的作用评估》最新51页

白宫发布《赢得AI竞赛：美国人工智能行动计划》最新28页

地下战：地下空间的战略博弈

《美地下作战条令手册》228页

相关资讯

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

Disentangled的假设的探讨

Disentangled的假设的探讨

CreateAMind

9+阅读 · 2018年12月10日

已删除

将门创投

4+阅读 · 2018年12月10日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【推荐】免费书(草稿)：数据科学的数学基础

【推荐】免费书(草稿)：数据科学的数学基础

机器学习研究会

20+阅读 · 2017年10月1日

【推荐】决策树/随机森林深入解析

【推荐】决策树/随机森林深入解析

机器学习研究会

5+阅读 · 2017年9月21日

Auto-Encoding GAN

Auto-Encoding GAN

CreateAMind

7+阅读 · 2017年8月4日

相关论文

Geometrically adapted Langevin dynamics for Markov chain Monte Carlo simulations

Arxiv

0+阅读 · 2022年1月20日

Strong error analysis of Euler methods for overdamped generalized Langevin equations with fractional noise: Nonlinear case

Arxiv

0+阅读 · 2022年1月19日

Generalized XGBoost Method

Arxiv

0+阅读 · 2022年1月19日

On the Convergence Rates of Policy Gradient Methods

Arxiv

0+阅读 · 2022年1月19日

On Centralized and Distributed Mirror Descent: Convergence Analysis Using Quadratic Constraints

Arxiv

0+阅读 · 2022年1月18日

No-Go Theorems for Distributive Laws

Arxiv

0+阅读 · 2022年1月17日

Risk bounds for PU learning under Selected At Random assumption

Arxiv

0+阅读 · 2022年1月17日

Reversible Random Walks on Dynamic Graphs

Arxiv

0+阅读 · 2022年1月17日

SGLB: Stochastic Gradient Langevin Boosting

Arxiv

0+阅读 · 2022年1月16日

Approximability of Discriminators Implies Diversity in GANs

Approximability of Discriminators Implies Diversity in GANs

Arxiv

4+阅读 · 2018年6月27日

微信扫码咨询专知VIP会员