通过可追踪的前例和概率、无因地自由语法的巴伊西亚决定树 (Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars) - 专知论文

会员服务 ·

0

样本 · 易处理的 · Performer · 准则 · Learning ·

2023 年 2 月 15 日

Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

翻译：通过可追踪的前例和概率、无因地自由语法的巴伊西亚决定树

Colin Sullivan,Mo Tiwari,Sebastian Thrun,Chris Piech

from arxiv, 10 pages, 1 figure

Decision Trees are some of the most popular machine learning models today due to their out-of-the-box performance and interpretability. Often, Decision Trees models are constructed greedily in a top-down fashion via heuristic search criteria, such as Gini impurity or entropy. However, trees constructed in this manner are sensitive to minor fluctuations in training data and are prone to overfitting. In contrast, Bayesian approaches to tree construction formulate the selection process as a posterior inference problem; such approaches are more stable and provide greater theoretical guarantees. However, generating Bayesian Decision Trees usually requires sampling from complex, multimodal posterior distributions. Current Markov Chain Monte Carlo-based approaches for sampling Bayesian Decision Trees are prone to mode collapse and long mixing times, which makes them impractical. In this paper, we propose a new criterion for training Bayesian Decision Trees. Our criterion gives rise to BCART-PCFG, which can efficiently sample decision trees from a posterior distribution across trees given the data and find the maximum a posteriori (MAP) tree. Learning the posterior and training the sampler can be done in time that is polynomial in the dataset size. Once the posterior has been learned, trees can be sampled efficiently (linearly in the number of nodes). At the core of our method is a reduction of sampling the posterior to sampling a derivation from a probabilistic context-free grammar. We find that trees sampled via BCART-PCFG perform comparable to or better than greedily-constructed Decision Trees in classification accuracy on several datasets. Additionally, the trees sampled via BCART-PCFG are significantly smaller -- sometimes by as much as 20x.

翻译：决策树是当今最受欢迎的机器学习模型之一, 因为它们在框外的性能和可解释性。通常, 决策树模型是通过超常搜索标准, 如 Gini 杂质或 entropy, 以自上而下的方式以自上而下的方式构建的。然而, 以这种方式构建的树对培训数据的轻微波动十分敏感, 并且容易过度配置。相反, 巴伊西亚树建设方法将选择过程描述成一个后继推论问题; 此类方法更稳定, 并提供更大的理论保障。但是, 创建巴伊西亚决策树通常需要从复杂、多式联运的海报分布中取样。目前 Markov 链 Monte Car 用于取样 Bayesian 决定树的策略容易发生模式崩溃和长时间混杂。然而, 我们的标定标准是 BCARRT- PC, 可以有效地从树的表面分布到树上的红外线( MAP) 树中采集最高级的。在可比较的取样方法中, 我们的骨质样本中, 将数据从一个直径流数据从一个直到一个开始, 。

0

相关内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

资源｜斯坦福课程：深度学习理论！

资源｜斯坦福课程：深度学习理论！

全球人工智能

17+阅读 · 2017年11月9日

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

自适应软件系统的无缝演化与环境感知技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

Progranulin在糖尿病肾病足细胞损伤中的保护作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

质子交换膜微结构中质子输运复杂行为的模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

地基InSAR高边坡三维变形提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

商丹构造带中高应变剪切带的野外和数值模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激介导的细胞内铁代谢紊乱在SAH后早期脑损伤的作用

国家自然科学基金

0+阅读 · 2012年12月31日

近紫外基白光LED用(CaCl2/SiO2):Eu,Mn橙红荧光粉的发光特性

国家自然科学基金

0+阅读 · 2009年12月31日

编织C/SiC复合材料的各向异性损伤演化及本构关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

抗MPB64-McAb INH、RFP聚乳酸纳米粒靶向治疗脊柱结核的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

Adjustable Privacy using Autoencoder-based Learning Structure

Arxiv

0+阅读 · 2023年4月7日

Robust Forecasting for Robotic Control: A Game-Theoretic Approach

Arxiv

0+阅读 · 2023年4月5日

Generalisation under gradient descent via deterministic PAC-Bayes

Arxiv

0+阅读 · 2023年4月4日

Self-restricting Noise and Exponential Decay in Quantum Dynamics

Arxiv

0+阅读 · 2023年4月4日

SAT Requires Exhaustive Search

Arxiv

0+阅读 · 2023年4月4日

A portfolio approach to massively parallel Bayesian optimization

Arxiv

0+阅读 · 2023年4月3日

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

Arxiv

0+阅读 · 2023年4月3日

Robotic Perception of Transparent Objects: A Review

Arxiv

0+阅读 · 2023年3月31日

Bayesian Clustering via Fusing of Localized Densities

Arxiv

0+阅读 · 2023年3月31日

Engagement Decision Support for Beyond Visual Range Air Combat

Engagement Decision Support for Beyond Visual Range Air Combat

Arxiv

63+阅读 · 2021年11月4日

VIP会员

文章信息

相关主题

相关VIP内容

INRIA 最新《机器学习理论》课程笔记，176页pdf

专知会员服务

51+阅读 · 2020年12月14日

【干货书】机器学习速查手册，135页pdf

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

Linux导论，Introduction to Linux，96页ppt

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

100+篇《自监督学习(Self-Supervised Learning)》论文最新合集

专知会员服务

166+阅读 · 2020年3月18日

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

Keras François Chollet 《Deep Learning with Python 》, 386页pdf

专知会员服务

160+阅读 · 2019年10月12日

强化学习最新教程，17页pdf

强化学习最新教程，17页pdf

专知会员服务

182+阅读 · 2019年10月11日

机器学习入门的经验与建议

机器学习入门的经验与建议

专知会员服务

94+阅读 · 2019年10月10日

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

【CMU卡内基梅隆大学】深度学习在计算机视觉的应用：方法，解释，因果与公平性

专知会员服务

83+阅读 · 2019年10月9日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

人机协同作战规划：来自美海军陆战队的大语言模型（LLM）使用教训

对北约军事总部战略规划制定与实施的研究 | 140页

美联参会指南-联合规划与执行概述及政策框架 | 32页

俄罗斯军事规划差异性凸显其思维的重要性 | 2025最新文献

相关资讯

VCIP 2022 Call for Demos

VCIP 2022 Call for Demos

CCF多媒体专委会

1+阅读 · 2022年6月6日

Hierarchically Structured Meta-learning

Hierarchically Structured Meta-learning

CreateAMind

27+阅读 · 2019年5月22日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

强化学习的Unsupervised Meta-Learning

强化学习的Unsupervised Meta-Learning

CreateAMind

18+阅读 · 2019年1月7日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

【代码资源】GAN | 七份最热GAN文章及代码分享（Github 1000+Stars）

专知

13+阅读 · 2018年6月24日

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

ResNet, AlexNet, VGG, Inception：各种卷积网络架构的理解

全球人工智能

20+阅读 · 2017年12月17日

【论文】变分推断（Variational inference)的总结

【论文】变分推断（Variational inference)的总结

机器学习研究会

39+阅读 · 2017年11月16日

资源｜斯坦福课程：深度学习理论！

资源｜斯坦福课程：深度学习理论！

全球人工智能

17+阅读 · 2017年11月9日

相关论文

Adjustable Privacy using Autoencoder-based Learning Structure

Arxiv

0+阅读 · 2023年4月7日

Robust Forecasting for Robotic Control: A Game-Theoretic Approach

Arxiv

0+阅读 · 2023年4月5日

Generalisation under gradient descent via deterministic PAC-Bayes

Arxiv

0+阅读 · 2023年4月4日

Self-restricting Noise and Exponential Decay in Quantum Dynamics

Arxiv

0+阅读 · 2023年4月4日

SAT Requires Exhaustive Search

Arxiv

0+阅读 · 2023年4月4日

A portfolio approach to massively parallel Bayesian optimization

Arxiv

0+阅读 · 2023年4月3日

Bayesian Controller Fusion: Leveraging Control Priors in Deep Reinforcement Learning for Robotics

Arxiv

0+阅读 · 2023年4月3日

Robotic Perception of Transparent Objects: A Review

Arxiv

0+阅读 · 2023年3月31日

Bayesian Clustering via Fusing of Localized Densities

Arxiv

0+阅读 · 2023年3月31日

Engagement Decision Support for Beyond Visual Range Air Combat

Engagement Decision Support for Beyond Visual Range Air Combat

Arxiv

63+阅读 · 2021年11月4日

相关基金

Decorin对急性缺血性卒中后血脑屏障中ZO-1蛋白的作用及机制研究

国家自然科学基金

0+阅读 · 2015年12月31日

自适应软件系统的无缝演化与环境感知技术研究

国家自然科学基金

0+阅读 · 2014年12月31日

Progranulin在糖尿病肾病足细胞损伤中的保护作用及分子机制

国家自然科学基金

0+阅读 · 2014年12月31日

质子交换膜微结构中质子输运复杂行为的模拟研究

国家自然科学基金

0+阅读 · 2013年12月31日

地基InSAR高边坡三维变形提取方法研究

国家自然科学基金

0+阅读 · 2013年12月31日

商丹构造带中高应变剪切带的野外和数值模拟研究

国家自然科学基金

0+阅读 · 2012年12月31日

内质网应激介导的细胞内铁代谢紊乱在SAH后早期脑损伤的作用

国家自然科学基金

0+阅读 · 2012年12月31日

近紫外基白光LED用(CaCl2/SiO2):Eu,Mn橙红荧光粉的发光特性

国家自然科学基金

0+阅读 · 2009年12月31日

编织C/SiC复合材料的各向异性损伤演化及本构关系研究

国家自然科学基金

0+阅读 · 2009年12月31日

抗MPB64-McAb INH、RFP聚乳酸纳米粒靶向治疗脊柱结核的实验研究

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员