Variable importance is defined as a measure of each regressor's contribution to model fit. Using R^2 as the fit criterion in linear models leads to the Shapley value (LMG) and proportionate value (PMVD) as variable importance measures. Similar measures are defined for ensemble models, using random forests as the example. The properties of the LMG and PMVD are compared. Variable importance is proposed to assess regressors' practical effects or "oomph." The uses of variable importance in modelling, interventions and causal analysis are discussed.
翻译:变量重要性被定义为衡量每个递减者对模型适用性的贡献的尺度。在线性模型中,使用R%2作为合适的标准,可导致以损耗值和成比例值作为可变重要性衡量尺度。对组合模型也规定了类似的措施,以随机森林为例。比较了LMG和PMVD的特性。提出了变量重要性,以评估递减者的实际效果或“缩影”。讨论了模型、干预和因果分析中不同重要性的用途。