Neural networks have had discernible achievements in a wide range of applications. The wide-spread adoption also raises the concern of their dependability and reliability. Similar to traditional decision-making programs, neural networks can have defects that need to be repaired. The defects may cause unsafe behaviors, raise security concerns or unjust societal impacts. In this work, we address the problem of repairing a neural network for desirable properties such as fairness and the absence of backdoor. The goal is to construct a neural network that satisfies the property by (minimally) adjusting the given neural network's parameters (i.e., weights). Specifically, we propose CARE (\textbf{CA}usality-based \textbf{RE}pair), a causality-based neural network repair technique that 1) performs causality-based fault localization to identify the `guilty' neurons and 2) optimizes the parameters of the identified neurons to reduce the misbehavior. We have empirically evaluated CARE on various tasks such as backdoor removal, neural network repair for fairness and safety properties. Our experiment results show that CARE is able to repair all neural networks efficiently and effectively. For fairness repair tasks, CARE successfully improves fairness by $61.91\%$ on average. For backdoor removal tasks, CARE reduces the attack success rate from over $98\%$ to less than $1\%$. For safety property repair tasks, CARE reduces the property violation rate to less than $1\%$. Results also show that thanks to the causality-based fault localization, CARE's repair focuses on the misbehavior and preserves the accuracy of the neural networks.
翻译:广泛采用神经网络在广泛的应用中取得了明显的成就。 广泛应用也提高了人们对神经网络可靠性和可靠性的关切。 与传统的决策程序类似, 神经网络可能存在需要修复的缺陷。 缺陷可能导致不安全的行为, 引起安全关切或不公正的社会影响。 在这项工作中, 我们解决修复神经网络的问题, 以寻找合适的属性, 如公平性和缺乏后门。 目标是建立一个神经网络, 以( 最小的) 调整给定神经网络的参数( 重量) 。 具体来说, 我们建议CRE(\ textbf{ CA} ) 以内网络为基础, 神经网络可能存在缺陷, 需要修复。 基于因果关系的网络修复技术1) 以因果关系为基础, 本地化缺陷为基础, 确定“ 有罪” 神经元, 2 优化所查明的神经元参数, 以减少错误。 我们从经验角度评估了CARE( ) 以后门清除为主, 神经网络修复公正和安全性属性。 我们的实验结果显示, CREA (crealalalality) 能够有效地修复所有成本 。