GPUUTreeShap: 树形组合的 SHAP 分数质量平行精确计算 (GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles) - 专知论文

会员服务 ·

0

决策树 · 确切的 · Machine Learning · PARCO · INTERACT ·

2021 年 7 月 19 日

GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles

翻译：GPUUTreeShap: 树形组合的 SHAP 分数质量平行精确计算

Rory Mitchell,Eibe Frank,Geoffrey Holmes

SHAP (SHapley Additive exPlanation) values provide a game theoretic interpretation of the predictions of machine learning models based on Shapley values. While exact calculation of SHAP values is computationally intractable in general, a recursive polynomial-time algorithm called TreeShap is available for decision tree models. However, despite its polynomial time complexity, TreeShap can become a significant bottleneck in practical machine learning pipelines when applied to large decision tree ensembles. We present GPUTreeShap, a modified TreeShap algorithm suitable for massively parallel computation on graphics processing units. Our approach first preprocesses each decision tree to isolate variable sized sub-problems from the original recursive algorithm, then solves a bin packing problem, and finally maps sub-problems to single-instruction, multiple-thread (SIMT) tasks for parallel execution with specialised hardware instructions. With a single NVIDIA Tesla V100-32 GPU, we achieve speedups of up to 19x for SHAP values, and speedups of up to 340x for SHAP interaction values, over a state-of-the-art multi-core CPU implementation executed on two 20-core Xeon E5-2698 v4 2.2 GHz CPUs. We also experiment with multi-GPU computing using eight V100 GPUs, demonstrating throughput of 1.2M rows per second -- equivalent CPU-based performance is estimated to require 6850 CPU cores.

翻译：SHapley Additivie Explanation) 值提供了基于 Shapely 值的机器学习模型预测的游戏理论解释。虽然精确计算 SHAP 值在总体上难以计算, 但是对于决策树型模型来说, 有一种叫做 TreamShap 的递归性多边时间算法。然而, 尽管它具有多式时间复杂性, 但是在应用到大型决策树群时, TreShap 可以成为实用机器学习管道中的重大瓶颈。我们提供了 GPUUTreeShap, 一种适用于图形处理单位大规模平行计算的修改的 TeraShap 算法。我们的第一个方法预处理每个决定树, 将变量大小小问题从原始的递归算算算中分离出来, 然后解决一个垃圾包装问题, 并最终绘制单项、多轨( SIMT) 任务, 用专门的硬件指令平行执行。我们的 NVIDIA Tesplue V100-302 GP, 我们为SHAP 的最多19x 等等值, 使用 C- 的 C- 20 C- mill 的 C- c- treal 和 C- tri- trial 执行的 C- tal 20- pros- true- true- true- sal- trial- sal- sal- sal 20x 20- trupal- sal- sal- sal- supal- supal- suput 20- supal- supal- supal 。

0

相关内容

决策树

决策树(Decision Tree）是在已知各种情况发生概率的基础上，通过构成决策树来求取净现值的期望值大于等于零的概率，评价项目风险，判断其可行性的决策分析方法，是直观运用概率分析的一种图解法。由于这种决策分支画成图形很像一棵树的枝干，故称决策树。在机器学习中，决策树是一个预测模型，他代表的是对象属性与对象值之间的一种映射关系。Entropy = 系统的凌乱程度，使用算法ID3, C4.5和C5.0生成树算法使用熵。这一度量是基于信息学理论中熵的概念。决策树是一种树形结构，其中每个内部节点表示一个属性上的测试，每个分支代表一个测试输出，每个叶节点代表一种类别。分类树（决策树）是一种十分常用的分类方法。他是一种监管学习，所谓监管学习就是给定一堆样本，每个样本都有一组属性和一个类别，这些类别是事先确定的，那么通过学习得到一个分类器，这个分类器能够对新出现的对象给出正确的分类。这样的机器学习就被称之为监督学习。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

2020-2021中国人工智能计算力发展评估报告, 36页pdf

2020-2021中国人工智能计算力发展评估报告, 36页pdf

专知会员服务

46+阅读 · 2021年3月20日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【百度研究院】2020年10大人工智能科技趋势，Baidu Research: 10 Technological Trends in 2020

专知会员服务

33+阅读 · 2019年12月23日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

Istio真的性能低吗？

Istio真的性能低吗？

高效开发运维

3+阅读 · 2019年9月24日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

开源星际争霸2多智能体挑战smac

开源星际争霸2多智能体挑战smac

专知

17+阅读 · 2019年2月13日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

Fast TreeSHAP: Accelerating SHAP Value Computation for Trees

Arxiv

0+阅读 · 2021年9月20日

An Improved and Optimized Practical Non-Blocking PageRank Algorithm for Massive Graphs

Arxiv

0+阅读 · 2021年9月20日

Scaling Enterprise Recommender Systems for Decentralization

Arxiv

0+阅读 · 2021年9月19日

Asynchronous and Distributed Data Augmentation for Massive Data Settings

Arxiv

0+阅读 · 2021年9月18日

Scheduling in Parallel Finite Buffer Systems: Optimal Decisions under Delayed Feedback

Scheduling in Parallel Finite Buffer Systems: Optimal Decisions under Delayed Feedback

Arxiv

0+阅读 · 2021年9月17日

Optimal Counterfactual Explanations in Tree Ensembles

Arxiv

5+阅读 · 2021年6月25日

Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems

Arxiv

7+阅读 · 2020年3月12日

Adaptive Neural Trees

Adaptive Neural Trees

Arxiv

4+阅读 · 2018年12月10日

Polya Urn Latent Dirichlet Allocation: a doubly sparse massively parallel sampler

Arxiv

3+阅读 · 2018年4月23日

CuLDA_CGS: Solving Large-scale LDA Problems on GPUs

Arxiv

3+阅读 · 2018年3月13日

VIP会员

文章信息

相关主题

Machine Learning

相关VIP内容

2020-2021中国人工智能计算力发展评估报告, 36页pdf

2020-2021中国人工智能计算力发展评估报告, 36页pdf

专知会员服务

46+阅读 · 2021年3月20日

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

【经典书】数据挖掘：理论、算法与示例，347页pdf，Nong Ye，Arizona State University

专知会员服务

82+阅读 · 2020年2月27日

【百度研究院】2020年10大人工智能科技趋势，Baidu Research: 10 Technological Trends in 2020

专知会员服务

33+阅读 · 2019年12月23日

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

【2020年AI趋势摘要：可嵌入、可迁移、可评价】《A Distilled List of AI Trends For 2020 - Towards Data Science》by Roberto Sannazzaro

专知会员服务

14+阅读 · 2019年12月20日

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

【O'Reilly AI Conference 2019】部署大规模分布式数据（How to deploy large-scale distributed data analytics and machine learning on containers (sponsored by HPE))，HPE BlueData，Thomas Phelan

专知会员服务

19+阅读 · 2019年11月5日

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

Connections between Support Vector Machines, Wasserstein distance and gradient-penalty GANs

专知会员服务

36+阅读 · 2019年10月17日

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

Deep Learning Based Detection and Correction of Cardiac MR Motion Artefacts During Reconstruction for High-Quality Segmentation

专知会员服务

59+阅读 · 2019年10月17日

2019年机器学习框架回顾

2019年机器学习框架回顾

专知会员服务

36+阅读 · 2019年10月11日

【哈佛大学商学院课程Fall 2019】机器学习可解释性

【哈佛大学商学院课程Fall 2019】机器学习可解释性

专知会员服务

105+阅读 · 2019年10月9日

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

【SIGGRAPH2019】TensorFlow 2.0深度学习计算机图形学应用

专知会员服务

41+阅读 · 2019年10月9日

热门VIP内容

开通专知VIP会员享更多权益服务

《小型无人机系统侦测追踪技术：声学、计算机视觉与深度学习融合方案》最新98页

《"牧羊人网格"拦截策略：实现无人机集群可靠拦截的新范式》

光纤无人机：反无人机系统的重大挑战

《作战建模与仿真实证研究》

相关资讯

Istio真的性能低吗？

Istio真的性能低吗？

高效开发运维

3+阅读 · 2019年9月24日

分布式并行架构Ray介绍

分布式并行架构Ray介绍

CreateAMind

10+阅读 · 2019年8月9日

计算机类 | PLDI 2020等国际会议信息6条

计算机类 | PLDI 2020等国际会议信息6条

Call4Papers

3+阅读 · 2019年7月8日

开源星际争霸2多智能体挑战smac

开源星际争霸2多智能体挑战smac

专知

17+阅读 · 2019年2月13日

Ray RLlib: Scalable 降龙十八掌

Ray RLlib: Scalable 降龙十八掌

CreateAMind

9+阅读 · 2018年12月28日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

Hierarchical Disentangled Representations

Hierarchical Disentangled Representations

CreateAMind

4+阅读 · 2018年4月15日

【学习】Hierarchical Softmax

【学习】Hierarchical Softmax

机器学习研究会

4+阅读 · 2017年8月6日

强化学习 cartpole_a3c

强化学习 cartpole_a3c

CreateAMind

9+阅读 · 2017年7月21日

相关论文

Fast TreeSHAP: Accelerating SHAP Value Computation for Trees

Arxiv

0+阅读 · 2021年9月20日

An Improved and Optimized Practical Non-Blocking PageRank Algorithm for Massive Graphs

Arxiv

0+阅读 · 2021年9月20日

Scaling Enterprise Recommender Systems for Decentralization

Arxiv

0+阅读 · 2021年9月19日

Asynchronous and Distributed Data Augmentation for Massive Data Settings

Arxiv

0+阅读 · 2021年9月18日

Scheduling in Parallel Finite Buffer Systems: Optimal Decisions under Delayed Feedback

Scheduling in Parallel Finite Buffer Systems: Optimal Decisions under Delayed Feedback

Arxiv

0+阅读 · 2021年9月17日

Optimal Counterfactual Explanations in Tree Ensembles

Arxiv

5+阅读 · 2021年6月25日

Distributed Hierarchical GPU Parameter Server for Massive Scale Deep Learning Ads Systems

Arxiv

7+阅读 · 2020年3月12日

Adaptive Neural Trees

Adaptive Neural Trees

Arxiv

4+阅读 · 2018年12月10日

Polya Urn Latent Dirichlet Allocation: a doubly sparse massively parallel sampler

Arxiv

3+阅读 · 2018年4月23日

CuLDA_CGS: Solving Large-scale LDA Problems on GPUs

Arxiv

3+阅读 · 2018年3月13日

微信扫码咨询专知VIP会员