面向软件工程任务的大语言模型可解释性研究 (Toward Explaining Large Language Models in Software Engineering Tasks)

Recent progress in Large Language Models (LLMs) has substantially advanced the automation of software engineering (SE) tasks, enabling complex activities such as code generation and code summarization. However, the black-box nature of LLMs remains a major barrier to their adoption in high-stakes and safety-critical domains, where explainability and transparency are vital for trust, accountability, and effective human supervision. Despite increasing interest in explainable AI for software engineering, existing methods lack domain-specific explanations aligned with how practitioners reason about SE artifacts. To address this gap, we introduce FeatureSHAP, the first fully automated, model-agnostic explainability framework tailored to software engineering tasks. Based on Shapley values, FeatureSHAP attributes model outputs to high-level input features through systematic input perturbation and task-specific similarity comparisons, while remaining compatible with both open-source and proprietary LLMs. We evaluate FeatureSHAP on two bi-modal SE tasks: code generation and code summarization. The results show that FeatureSHAP assigns less importance to irrelevant input features and produces explanations with higher fidelity than baseline methods. A practitioner survey involving 37 participants shows that FeatureSHAP helps practitioners better interpret model outputs and make more informed decisions. Collectively, FeatureSHAP represents a meaningful step toward practical explainable AI in software engineering. FeatureSHAP is available at https://github.com/deviserlab/FeatureSHAP.

翻译：近年来，大语言模型（LLMs）的发展显著推动了软件工程（SE）任务的自动化，实现了代码生成与代码摘要等复杂活动。然而，LLMs的黑箱特性仍然是其在高风险与安全关键领域应用的主要障碍，这些领域中的可解释性与透明度对于建立信任、确保问责制以及实现有效的人工监督至关重要。尽管针对软件工程的可解释人工智能研究日益受到关注，但现有方法缺乏与从业者对软件工程制品推理方式相一致的领域特定解释。为填补这一空白，我们提出了FeatureSHAP——首个为软件工程任务定制的全自动、模型无关的可解释性框架。基于沙普利值，FeatureSHAP通过系统化的输入扰动和任务特定的相似性比较，将模型输出归因于高层次输入特征，同时保持对开源与专有LLMs的兼容性。我们在两项双模态软件工程任务（代码生成与代码摘要）上评估FeatureSHAP。结果表明，与基线方法相比，FeatureSHAP对不相关输入特征赋予的重要性更低，且产生的解释具有更高的保真度。一项涉及37名参与者的从业者调研显示，FeatureSHAP能帮助从业者更好地解读模型输出并做出更明智的决策。总体而言，FeatureSHAP代表了软件工程领域迈向实用可解释人工智能的重要一步。FeatureSHAP项目地址：https://github.com/deviserlab/FeatureSHAP。