随机森林特征重要性的Shapley值分组用于可解释的产量预测 (Grouping Shapley Value Feature Importances of Random Forests for explainable Yield Prediction) - 专知论文

会员服务 ·

0

随机森林 · 机器学习模型 · 学习模型 · 高准确性 · 高效计算 ·

2023 年 4 月 14 日

Grouping Shapley Value Feature Importances of Random Forests for explainable Yield Prediction

翻译：随机森林特征重要性的Shapley值分组用于可解释的产量预测

Florian Huber,Hannes Engler,Anna Kicherer,Katja Herzog,Reinhard Töpfer,Volker Steinhage

from arxiv, Preprint accepted at IntelliSys 2023

Explainability in yield prediction helps us fully explore the potential of machine learning models that are already able to achieve high accuracy for a variety of yield prediction scenarios. The data included for the prediction of yields are intricate and the models are often difficult to understand. However, understanding the models can be simplified by using natural groupings of the input features. Grouping can be achieved, for example, by the time the features are captured or by the sensor used to do so. The state-of-the-art for interpreting machine learning models is currently defined by the game-theoretic approach of Shapley values. To handle groups of features, the calculated Shapley values are typically added together, ignoring the theoretical limitations of this approach. We explain the concept of Shapley values directly computed for predefined groups of features and introduce an algorithm to compute them efficiently on tree structures. We provide a blueprint for designing swarm plots that combine many local explanations for global understanding. Extensive evaluation of two different yield prediction problems shows the worth of our approach and demonstrates how we can enable a better understanding of yield prediction models in the future, ultimately leading to mutual enrichment of research and application.

翻译：在产量预测可解释性方面有着极高准确性的机器学习模型的潜力需要得到充分探索。产量预测所需的数据复杂多样，因此往往难以理解其背后的模型。然而，通过自然分组输入特征，我们可以简化对这些模型的理解。分组可以通过输入特征产生的时间或传感器使用情况等方式实现。目前解释机器学习模型的最新方法是由Shapley值这一博弈论方法论定义的。为了处理特征组，当前通常采用对计算得到的Shapley值相加的方式，忽略了这种做法的理论局限性。本文阐述了直接计算预定义特征组的Shapley值的概念，并且介绍了一种在树结构上高效计算Shapley值的算法。通过提供融合许多局部解释以得出全局结论的“Swarm Plots”设计蓝图，我们为将来设计更好的产量预测模型带来了启示。在两个不同的产量预测问题上进行了广泛评估，证明了我们的方法的有效性，并展示了如何更好地理解产量预测模型，这将最终促进互相的研究和应用的共同发展。

0

相关内容

随机森林

随机森林指的是利用多棵树对样本进行训练并预测的一种分类器。

知识荟萃

精品入门和进阶教程、论文和代码整理等

更多

查看相关VIP内容、论文、资讯等

网络表示如何可解释？Syracuse大学最新WWW2022《可解释表示学习》教程，附97页ppt

网络表示如何可解释？Syracuse大学最新WWW2022《可解释表示学习》教程，附97页ppt

专知会员服务

50+阅读 · 2022年4月30日

我们真的需要深度学习模型来预测时间序列吗? Do We Really Need Deep Learning Models for Time Series Forecasting?

我们真的需要深度学习模型来预测时间序列吗? Do We Really Need Deep Learning Models for Time Series Forecasting?

专知会员服务

37+阅读 · 2022年3月13日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

135+阅读 · 2020年5月1日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

【O'Reilly AI Conference 2019】高管简报：机器学习系统隐私的进步（Executive Briefing: Advances in privacy for machine learning systems），Katharine Jarmul

【O'Reilly AI Conference 2019】高管简报：机器学习系统隐私的进步（Executive Briefing: Advances in privacy for machine learning systems），Katharine Jarmul

专知会员服务

16+阅读 · 2019年11月5日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

论文浅尝 | Explainable Link Prediction in Knowledge Hypergraphs

论文浅尝 | Explainable Link Prediction in Knowledge Hypergraphs

开放知识图谱

1+阅读 · 2022年11月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AAAI 2022 | ProtGNN：自解释图神经网络络

AAAI 2022 | ProtGNN：自解释图神经网络络

PaperWeekly

0+阅读 · 2022年8月22日

图神经网络的可解释性方法介绍和GNNExplainer解释预测的代码示例（附代码）

图神经网络的可解释性方法介绍和GNNExplainer解释预测的代码示例（附代码）

图与推荐

1+阅读 · 2022年7月30日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

上扬子东部地区五峰组-龙马溪组笔石有机质对埋藏有机碳的贡献及其地质意义

国家自然科学基金

0+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

对偶Auslander转置及其诱导模类的同调性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

水资源管理决策过程的不确定性和决策行为模拟研究

国家自然科学基金

6+阅读 · 2014年12月31日

不同基因型（p53codon72）鼻咽癌细胞放射敏感性差异的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

多媒体传感器网络中目标多分类问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

补偿性还是非补偿性规则：探析风险决策的行为与神经机制

国家自然科学基金

0+阅读 · 2011年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

矩阵分解在基因组关联研究中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

An End-to-End Time Series Model for Simultaneous Imputation and Forecast

Arxiv

1+阅读 · 2023年6月1日

Do not Interfere but Cooperate: A Fully Learnable Code Design for Multi-Access Channels with Feedback

Arxiv

0+阅读 · 2023年6月1日

Explaining Recommendation System Using Counterfactual Textual Explanations

Arxiv

0+阅读 · 2023年6月1日

Learning Perturbations to Explain Time Series Predictions

Arxiv

0+阅读 · 2023年5月30日

Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs

Arxiv

15+阅读 · 2022年11月29日

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Arxiv

22+阅读 · 2022年9月14日

Explainable Deep Learning: A Field Guide for the Uninitiated

Arxiv

51+阅读 · 2021年9月13日

IoT Solutions with Multi-Sensor Fusion and Signal-Image Encoding for Secure Data Transfer and Decision Making

Arxiv

38+阅读 · 2021年6月2日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

VIP会员

文章信息

相关主题

机器学习模型

相关VIP内容

网络表示如何可解释？Syracuse大学最新WWW2022《可解释表示学习》教程，附97页ppt

网络表示如何可解释？Syracuse大学最新WWW2022《可解释表示学习》教程，附97页ppt

专知会员服务

50+阅读 · 2022年4月30日

我们真的需要深度学习模型来预测时间序列吗? Do We Really Need Deep Learning Models for Time Series Forecasting?

我们真的需要深度学习模型来预测时间序列吗? Do We Really Need Deep Learning Models for Time Series Forecasting?

专知会员服务

37+阅读 · 2022年3月13日

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

54+阅读 · 2021年1月20日

【万字长文】注意力机制可解释大论述

专知会员服务

55+阅读 · 2020年11月17日

【PKDD2020教程】可解释人工智能XAI:算法到应用，200页ppt

专知会员服务

41+阅读 · 2020年10月13日

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

知识图嵌入和可解释人工智能 Knowledge Graph Embeddings and Explainable AI

专知会员服务

135+阅读 · 2020年5月1日

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

【CHI2020-微软】解释可解释性:理解数据科学家使用机器学习的可解释性工具，Interpreting Interpretability: Understanding Data Scientists’Use of Interpretability Tools for Machine Learning

专知会员服务

55+阅读 · 2020年3月8日

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

【Nature交叉学科论文】机器学习在固体材料科学中的最新进展和应用，Recent advances and applications of machine learning in solidstate materials science

专知会员服务

36+阅读 · 2019年12月21日

【O'Reilly AI Conference 2019】高管简报：机器学习系统隐私的进步（Executive Briefing: Advances in privacy for machine learning systems），Katharine Jarmul

【O'Reilly AI Conference 2019】高管简报：机器学习系统隐私的进步（Executive Briefing: Advances in privacy for machine learning systems），Katharine Jarmul

专知会员服务

16+阅读 · 2019年11月5日

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

【ICCV 2019 Toturial】Interpretable Machine Learning for Computer Vision（用于计算机视觉的可解释性机器学习）

专知会员服务

32+阅读 · 2019年10月30日

热门VIP内容

开通专知VIP会员享更多权益服务

《陆军战斗操练中的关键事件诊断》

《自适应训练辅助概念及其在空战管理员加速训练中的应用导论》最新126页

军事通信市场七大趋势概述

《抗干扰无人机蜂群行为的遗传算法方法》

相关资讯

论文浅尝 | Explainable Link Prediction in Knowledge Hypergraphs

论文浅尝 | Explainable Link Prediction in Knowledge Hypergraphs

开放知识图谱

1+阅读 · 2022年11月11日

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

直播 | Interpretable and Trustworthy Graph Geometric Deep Learning

图与推荐

2+阅读 · 2022年11月2日

AAAI 2022 | ProtGNN：自解释图神经网络络

AAAI 2022 | ProtGNN：自解释图神经网络络

PaperWeekly

0+阅读 · 2022年8月22日

图神经网络的可解释性方法介绍和GNNExplainer解释预测的代码示例（附代码）

图神经网络的可解释性方法介绍和GNNExplainer解释预测的代码示例（附代码）

图与推荐

1+阅读 · 2022年7月30日

Transferring Knowledge across Learning Processes

Transferring Knowledge across Learning Processes

CreateAMind

29+阅读 · 2019年5月18日

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

【Awesome】最全的机器学习可解释性资料（machine-learning-interpretability）

专知

29+阅读 · 2019年3月1日

Unsupervised Learning via Meta-Learning

Unsupervised Learning via Meta-Learning

CreateAMind

43+阅读 · 2019年1月3日

A Technical Overview of AI & ML in 2018 & Trends for 2019

A Technical Overview of AI & ML in 2018 & Trends for 2019

待字闺中

18+阅读 · 2018年12月24日

利用动态深度学习预测金融时间序列基于Python

利用动态深度学习预测金融时间序列基于Python

量化投资与机器学习

18+阅读 · 2018年10月30日

LibRec 精选：推荐的可解释性[综述]

LibRec 精选：推荐的可解释性[综述]

LibRec智能推荐

10+阅读 · 2018年5月4日

相关论文

An End-to-End Time Series Model for Simultaneous Imputation and Forecast

Arxiv

1+阅读 · 2023年6月1日

Do not Interfere but Cooperate: A Fully Learnable Code Design for Multi-Access Channels with Feedback

Arxiv

0+阅读 · 2023年6月1日

Explaining Recommendation System Using Counterfactual Textual Explanations

Arxiv

0+阅读 · 2023年6月1日

Learning Perturbations to Explain Time Series Predictions

Arxiv

0+阅读 · 2023年5月30日

Lifelong Embedding Learning and Transfer for Growing Knowledge Graphs

Arxiv

15+阅读 · 2022年11月29日

A Review and Roadmap of Deep Learning Causal Discovery in Different Variable Paradigms

Arxiv

22+阅读 · 2022年9月14日

Explainable Deep Learning: A Field Guide for the Uninitiated

Arxiv

51+阅读 · 2021年9月13日

IoT Solutions with Multi-Sensor Fusion and Signal-Image Encoding for Secure Data Transfer and Decision Making

Arxiv

38+阅读 · 2021年6月2日

Explainable Reasoning over Knowledge Graphs for Recommendation

Arxiv

11+阅读 · 2018年11月12日

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

A Memory-Network Based Solution for Multivariate Time-Series Forecasting

Arxiv

13+阅读 · 2018年9月6日

相关基金

上扬子东部地区五峰组-龙马溪组笔石有机质对埋藏有机碳的贡献及其地质意义

国家自然科学基金

0+阅读 · 2015年12月31日

Volterra积分微分方程的多区间Chebyshev和Legendre谱配置法

国家自然科学基金

0+阅读 · 2015年12月31日

对偶Auslander转置及其诱导模类的同调性质研究

国家自然科学基金

0+阅读 · 2015年12月31日

水资源管理决策过程的不确定性和决策行为模拟研究

国家自然科学基金

6+阅读 · 2014年12月31日

不同基因型（p53codon72）鼻咽癌细胞放射敏感性差异的机制研究

国家自然科学基金

0+阅读 · 2012年12月31日

多媒体传感器网络中目标多分类问题研究

国家自然科学基金

0+阅读 · 2012年12月31日

补偿性还是非补偿性规则：探析风险决策的行为与神经机制

国家自然科学基金

0+阅读 · 2011年12月31日

de novo预测蛋白质结构的并行元启发方法研究

国家自然科学基金

0+阅读 · 2009年12月31日

不同类型强心苷抗肿瘤活性的研究

国家自然科学基金

0+阅读 · 2009年12月31日

矩阵分解在基因组关联研究中的应用

国家自然科学基金

0+阅读 · 2009年12月31日

微信扫码咨询专知VIP会员