分组地物重要性和合并地物效果绘图 (Grouped Feature Importance and Combined Features Effect Plot)

Interpretable machine learning has become a very active area of research due to the rising popularity of machine learning algorithms and their inherently challenging interpretability. Most work in this area has been focused on the interpretation of single features in a model. However, for researchers and practitioners, it is often equally important to quantify the importance or visualize the effect of feature groups. To address this research gap, we provide a comprehensive overview of how existing model-agnostic techniques can be defined for feature groups to assess the grouped feature importance, focusing on permutation-based, refitting, and Shapley-based methods. We also introduce an importance-based sequential procedure that identifies a stable and well-performing combination of features in the grouped feature space. Furthermore, we introduce the combined features effect plot, which is a technique to visualize the effect of a group of features based on a sparse, interpretable linear combination of features. We used simulation studies and a real data example from computational psychology to analyze, compare, and discuss these methods.

翻译：由于机器学习算法越来越受人欢迎,而且其内在解释性也具有挑战性,解释性机器学习已成为一个非常活跃的研究领域。这一领域的大部分工作都集中在对模型中单一特征的解释上。然而,对于研究人员和从业人员来说,用数量表示特征组的重要性或可视化其影响往往同样重要。为了解决这一研究差距,我们全面概述了如何为特征组界定现有的模型-不可知性技术,以评估群集特征的重要性,重点是基于变换、改编和基于毛质的方法。我们还采用了基于重要性的相继程序,确定组合特征空间中各种特征的稳定和良好组合。此外,我们引入了组合特征效应图,这是一种根据零散、可解释的线性组合对一组特征的效应进行直观直观描述的技术。我们利用模拟研究和从计算心理学中得出的真实数据实例来分析、比较和讨论这些方法。

相关内容

GROUP

关注 1

Group一直是研究计算机支持的合作工作、人机交互、计算机支持的协作学习和社会技术研究的主要场所。该会议将社会科学、计算机科学、工程、设计、价值观以及其他与小组工作相关的多个不同主题的工作结合起来，并进行了广泛的概念化。官网链接：https://group.acm.org/conferences/group20/

剑桥大学《数据科学: 原理与实践》课程，附PPT下载

专知会员服务

53+阅读 · 2021年1月20日

【ETH】最新《几何数据分析》2020课程，附PPT下载

专知会员服务

44+阅读 · 2020年12月18日

【干货书】机器学习速查手册，135页pdf

专知会员服务

127+阅读 · 2020年11月20日

KDD20 | 图模型的解释技术专题

专知会员服务

32+阅读 · 2020年9月4日