Fairness of machine learning (ML) software has become a major concern in the recent past. Although recent research on testing and improving fairness have demonstrated impact on real-world software, providing fairness guarantee in practice is still lacking. Certification of ML models is challenging because of the complex decision-making process of the models. In this paper, we proposed Fairify, an SMT-based approach to verify individual fairness property in neural network (NN) models. Individual fairness ensures that any two similar individuals get similar treatment irrespective of their protected attributes e.g., race, sex, age. Verifying this fairness property is hard because of the global checking and non-linear computation nodes in NN. We proposed sound approach to make individual fairness verification tractable for the developers. The key idea is that many neurons in the NN always remain inactive when a smaller part of the input domain is considered. So, Fairify leverages whitebox access to the models in production and then apply formal analysis based pruning. Our approach adopts input partitioning and then prunes the NN for each partition to provide fairness certification or counterexample. We leveraged interval arithmetic and activation heuristic of the neurons to perform the pruning as necessary. We evaluated Fairify on 25 real-world neural networks collected from four different sources, and demonstrated the effectiveness, scalability and performance over baseline and closely related work. Fairify is also configurable based on the domain and size of the NN. Our novel formulation of the problem can answer targeted verification queries with relaxations and counterexamples, which have practical implications.
翻译:虽然最近关于测试和改进公平性的研究对现实世界软件产生了明显的影响,但实际中仍然缺乏公平保障。由于模型决策过程复杂,对模型的认证具有挑战性。在本文中,我们提议公平性,即基于SMT的办法来核查神经网络模型的个人公平性财产。个人公平性确保任何两个类似的个人都得到类似的待遇,而不论其受保护的属性,例如种族、性别、年龄。由于全球检查和非线性计算节点对NNN产生的影响,核实这种公平性财产是困难的。我们提出了使个人公平性核查便于开发者使用的正确方法。关键的想法是,在考虑输入领域较小部分时,NNFeral性模型的许多神经性分子总是不活跃。因此,公平性地利用生产模型的白箱访问,然后根据分类进行正式分析。我们的方法采用输入平衡,然后对每个分区的NNW进行简化,以便提供公平性认证或反直线性计算。我们用真实的准确性估算和激活了神经网络的准确性能,我们用真实性估算和直径网络的四大范围进行对比。