可解释人工智能中的隐私风险与保护方法：一项范围综述 (Privacy Risks and Preservation Methods in Explainable Artificial Intelligence: A Scoping Review)

Explainable Artificial Intelligence (XAI) has emerged as a pillar of Trustworthy AI and aims to bring transparency in complex models that are opaque by nature. Despite the benefits of incorporating explanations in models, an urgent need is found in addressing the privacy concerns of providing this additional information to end users. In this article, we conduct a scoping review of existing literature to elicit details on the conflict between privacy and explainability. Using the standard methodology for scoping review, we extracted 57 articles from 1,943 studies published from January 2019 to December 2024. The review addresses 3 research questions to present readers with more understanding of the topic: (1) what are the privacy risks of releasing explanations in AI systems? (2) what current methods have researchers employed to achieve privacy preservation in XAI systems? (3) what constitutes a privacy preserving explanation? Based on the knowledge synthesized from the selected studies, we categorize the privacy risks and preservation methods in XAI and propose the characteristics of privacy preserving explanations to aid researchers and practitioners in understanding the requirements of XAI that is privacy compliant. Lastly, we identify the challenges in balancing privacy with other system desiderata and provide recommendations for achieving privacy preserving XAI. We expect that this review will shed light on the complex relationship of privacy and explainability, both being the fundamental principles of Trustworthy AI.

翻译：可解释人工智能（XAI）已成为可信人工智能的支柱，其目标是为本质不透明的复杂模型带来透明度。尽管在模型中引入解释具有诸多益处，但向最终用户提供此类额外信息所引发的隐私关切亟待解决。本文通过对现有文献进行范围综述，以揭示隐私与可解释性之间冲突的具体细节。采用标准的范围综述方法，我们从2019年1月至2024年12月发表的1,943项研究中提取了57篇文献。本综述通过回答三个研究问题，为读者提供对该主题更深入的理解：（1）在AI系统中发布解释存在哪些隐私风险？（2）研究者目前已采用哪些方法来实现XAI系统中的隐私保护？（3）何为隐私保护型解释？基于从所选研究中综合的知识，我们对XAI中的隐私风险与保护方法进行了分类，并提出了隐私保护型解释应具备的特征，以帮助研究者和实践者理解符合隐私要求的XAI所需满足的条件。最后，我们指出了在隐私与其他系统期望目标之间取得平衡所面临的挑战，并提供了实现隐私保护型XAI的建议。我们期望本综述能阐明隐私与可解释性——二者同为可信人工智能基本原则——之间的复杂关系。