使用形状值和变化式自动解析器解释具有依附性混合特征的预测模型 (Using Shapley Values and Variational Autoencoders to Explain Predictive Models with Dependent Mixed Features)

Shapley values are today extensively used as a model-agnostic explanation framework to explain complex predictive machine learning models. Shapley values have desirable theoretical properties and a sound mathematical foundation. Precise Shapley value estimates for dependent data rely on accurate modeling of the dependencies between all feature combinations. In this paper, we use a variational autoencoder with arbitrary conditioning (VAEAC) to model all feature dependencies simultaneously. We demonstrate through comprehensive simulation studies that VAEAC outperforms the state-of-the-art methods for a wide range of settings for both continuous and mixed dependent features. Finally, we apply VAEAC to the Abalone data set from the UCI Machine Learning Repository.

翻译：光谱值如今被广泛用作解释复杂预测机器学习模型的模型-不可知解释框架。光谱值具有理想的理论属性和健全的数学基础。光谱值对依赖数据的精确估计依赖于所有特征组合之间依赖性的准确模型。在本文中,我们使用一个具有任意调节功能的变式自动编码器(VAEAC)同时模拟所有特征依赖性。我们通过综合模拟研究来证明,光谱仪在连续和混合依赖性特征的多种环境中都优于最先进的方法。最后,我们将光谱光谱值应用到从 UCI 机器学习存储库收集的Abone 数据中。

相关内容

自编码器

关注 140

自动编码器是一种人工神经网络，用于以无监督的方式学习有效的数据编码。自动编码器的目的是通过训练网络忽略信号“噪声”来学习一组数据的表示（编码），通常用于降维。与简化方面一起，学习了重构方面，在此，自动编码器尝试从简化编码中生成尽可能接近其原始输入的表示形式，从而得到其名称。基本模型存在几种变体，其目的是迫使学习的输入表示形式具有有用的属性。自动编码器可有效地解决许多应用问题，从面部识别到获取单词的语义。

【经典书】线性代数，436页pdf

专知会员服务

78+阅读 · 2021年3月16日

【KDD2020】基于节点-边缘协同解纠缠的可解释深图生成，Interpretable Deep Graph Generation with Node-edge Co-disentanglement

专知会员服务

32+阅读 · 2020年6月11日

【AAAI 2020】InteractE: 通过增加特征交互来改进基于卷积的知识图谱嵌入， InteractE: Improving Convolution-based Knowledge Graph Embeddings by Increasing Feature Interactions

专知会员服务

53+阅读 · 2020年6月7日

【跨语言BERT模型大集合】Transfer learning is increasingly going multilingual with language-specific BERT models

专知会员服务

54+阅读 · 2020年1月30日