关于大赦国际黑盒解释的关键经验研究 (Critical Empirical Study on Black-box Explanations in AI)

from arxiv, International Conference on Information Systems 2021, Dec 2021, JW Marriott Austin, United States. arXiv admin note: text overlap with arXiv:2109.09586

This paper provides empirical concerns about post-hoc explanations of black-box ML models, one of the major trends in AI explainability (XAI), by showing its lack of interpretability and societal consequences. Using a representative consumer panel to test our assumptions, we report three main findings. First, we show that post-hoc explanations of black-box model tend to give partial and biased information on the underlying mechanism of the algorithm and can be subject to manipulation or information withholding by diverting users' attention. Secondly, we show the importance of tested behavioral indicators, in addition to self-reported perceived indicators, to provide a more comprehensive view of the dimensions of interpretability. This paper contributes to shedding new light on the actual theoretical debate between intrinsically transparent AI models and post-hoc explanations of black-box complex models-a debate which is likely to play a highly influential role in the future development and operationalization of AI systems.

翻译：本文对黑盒 ML 模型的热后解释提出了经验上的关切,这是AI解释性的主要趋势之一,表明其缺乏可解释性和社会后果。我们用一个有代表性的消费者小组来测试我们的假设,我们报告了三个主要结论。首先,我们表明,黑盒模型的热后解释往往对算法的基本机制产生偏颇和偏颇的信息,并可能通过转移用户的注意力而加以操纵或隐瞒信息。第二,我们表明,除了自我报告的可识别指标外,测试的行为指标对于更全面地了解可解释性层面的重要性。本文有助于重新阐明在本质上透明的AI 模型与黑盒复杂模型的事后解释之间的实际理论辩论,这种辩论有可能对AI系统的未来开发和运行产生很大影响。

相关内容

黑盒

关注 1

在科学，计算和工程学中，黑盒是一种设备，系统或对象，可以根据其输入和输出（或传输特性）对其进行查看，而无需对其内部工作有任何了解。它的实现是“不透明的”（黑色）。几乎任何事物都可以被称为黑盒：晶体管，引擎，算法，人脑，机构或政府。为了使用典型的“黑匣子方法”来分析建模为开放系统的事物，仅考虑刺激/响应的行为，以推断（未知）盒子。该黑匣子系统的通常表示形式是在该方框中居中的数据流程图。黑盒的对立面是一个内部组件或逻辑可用于检查的系统，通常将其称为白盒（有时也称为“透明盒”或“玻璃盒”）。

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日

回顾机器学习公平的数学框架，Review of Mathematical frameworks for Fairness in Machine Learning

专知会员服务

38+阅读 · 2020年5月30日

可解释强化学习，Explainable Reinforcement Learning: A Survey

专知会员服务

131+阅读 · 2020年5月14日

因果图，Causal Graphs，52页ppt

专知会员服务

250+阅读 · 2020年4月19日