可解释的AI能够解释不公平吗? 评估可解释的AI的框架。 (Can Explainable AI Explain Unfairness? A Framework for Evaluating Explainable AI)

Many ML models are opaque to humans, producing decisions too complex for humans to easily understand. In response, explainable artificial intelligence (XAI) tools that analyze the inner workings of a model have been created. Despite these tools' strength in translating model behavior, critiques have raised concerns about the impact of XAI tools as a tool for `fairwashing` by misleading users into trusting biased or incorrect models. In this paper, we created a framework for evaluating explainable AI tools with respect to their capabilities for detecting and addressing issues of bias and fairness as well as their capacity to communicate these results to their users clearly. We found that despite their capabilities in simplifying and explaining model behavior, many prominent XAI tools lack features that could be critical in detecting bias. Developers can use our framework to suggest modifications needed in their toolkits to reduce issues likes fairwashing.

翻译：许多ML模型对人类而言不透明,产生对人类而言过于复杂的决定,使人难以理解。作为回应,已经创建了可解释的人工智能(XAI)工具来分析模型的内部功能。尽管这些工具在翻译模型行为方面的实力,但批评引起了人们对XAI工具的影响的关切,这种工具是误导用户的“公平洗刷”工具,使其相信有偏见或不正确的模型。在这份文件中,我们建立了一个框架,用于评价可解释的AI工具,这些工具能够发现和解决偏见和公平问题,以及它们向用户明确传达这些结果的能力。我们发现,尽管它们有能力简化和解释模型行为,但许多突出的XAI工具缺乏在发现偏见方面至关重要的特点。开发者可以利用我们的框架,建议对其工具进行所需的修改,以减少像公平洗钱这样的问题。

相关内容

TOOLS

关注 1

这个新版本的工具会议系列恢复了从1989年到2012年的50个会议的传统。工具最初是“面向对象语言和系统的技术”，后来发展到包括软件技术的所有创新方面。今天许多最重要的软件概念都是在这里首次引入的。2019年TOOLS 50+1在俄罗斯喀山附近举行，以同样的创新精神、对所有与软件相关的事物的热情、科学稳健性和行业适用性的结合以及欢迎该领域所有趋势和社区的开放态度，延续了该系列。官网链接：http://tools2019.innopolis.ru/

【2021干货书】Python可解释人工智能，207页pdf，Explainable AI with Python

专知会员服务

186+阅读 · 2021年5月17日

【机器推理可解释性】Machine Reasoning Explainability

专知会员服务

35+阅读 · 2020年9月3日

Linux导论，Introduction to Linux，96页ppt

专知会员服务

81+阅读 · 2020年7月26日