显示还是禁止? 管理机器学习模型解释中的输入不确定性 (Show or Suppress? Managing Input Uncertainty in Machine Learning Model Explanations)

Feature attribution is widely used in interpretable machine learning to explain how influential each measured input feature value is for an output inference. However, measurements can be uncertain, and it is unclear how the awareness of input uncertainty can affect the trust in explanations. We propose and study two approaches to help users to manage their perception of uncertainty in a model explanation: 1) transparently show uncertainty in feature attributions to allow users to reflect on, and 2) suppress attribution to features with uncertain measurements and shift attribution to other features by regularizing with an uncertainty penalty. Through simulation experiments, qualitative interviews, and quantitative user evaluations, we identified the benefits of moderately suppressing attribution uncertainty, and concerns regarding showing attribution uncertainty. This work adds to the understanding of handling and communicating uncertainty for model interpretability.

翻译：在可解释的机器学习中广泛使用特性归属来解释每个计量输入特性值对产出推导的影响力。然而,测量可能不确定,对投入不确定性的认识如何影响解释的信任还不清楚。我们提出并研究两种办法,帮助用户在示范解释中管理其对不确定性的看法:(1) 透明地显示特性属性的不确定性,以便用户能够思考;(2) 制止对具有不确定测量特性的特性的归属,并通过对不确定性的处罚进行规范化,将归属归属转移到其他特征。通过模拟实验、定性访谈和定量用户评价,我们确定了适度抑制属性不确定性的好处,以及显示归属不确定性的关切。这项工作增加了对处理和传递模型解释不确定性的理解。

相关内容

Machine Learning

关注 2241

机器学习（Machine Learning）是一个研究计算学习方法的国际论坛。该杂志发表文章，报告广泛的学习方法应用于各种学习问题的实质性结果。该杂志的特色论文描述研究的问题和方法，应用研究和研究方法的问题。有关学习问题或方法的论文通过实证研究、理论分析或与心理现象的比较提供了坚实的支持。应用论文展示了如何应用学习方法来解决重要的应用问题。研究方法论文改进了机器学习的研究方法。所有的论文都以其他研究人员可以验证或复制的方式描述了支持证据。论文还详细说明了学习的组成部分，并讨论了关于知识表示和性能任务的假设。官网地址：http://dblp.uni-trier.de/db/journals/ml/

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

【干货书】真实机器学习，264页pdf，Real-World Machine Learning

专知会员服务

115+阅读 · 2020年4月5日