With the widespread deployment of deep neural networks (DNNs), ensuring the reliability of DNN-based systems is of great importance. Serious reliability issues such as system failures can be caused by numerical defects, one of the most frequent defects in DNNs. To assure high reliability against numerical defects, in this paper, we propose the RANUM approach including novel techniques for three reliability assurance tasks: detection of potential numerical defects, confirmation of potential-defect feasibility, and suggestion of defect fixes. To the best of our knowledge, RANUM is the first approach that confirms potential-defect feasibility with failure-exhibiting tests and suggests fixes automatically. Extensive experiments on the benchmarks of 63 real-world DNN architectures show that RANUM outperforms state-of-the-art approaches across the three reliability assurance tasks. In addition, when the RANUM-generated fixes are compared with developers' fixes on open-source projects, in 37 out of 40 cases, RANUM-generated fixes are equivalent to or even better than human fixes.
翻译:由于广泛部署深层神经网络,确保基于DNN的系统的可靠性非常重要。严重的可靠性问题,如系统失灵可能是数字缺陷造成的,这是DNN最经常出现的缺陷之一。为了保证对数字缺陷的高度可靠性,我们在本文件中建议使用RANUM方法,包括三项可靠性保证任务的新技术:发现潜在数字缺陷、确认潜在缺陷的可行性以及提出缺陷纠正建议。据我们所知,RANUM是第一个在抑制故障的测试中确认潜在缺陷可行性并自动提出修正建议的方法。关于63个真实世界DNNN的建筑基准的广泛实验表明,RANUM在三项可靠性保证任务中超越了最先进的方法。此外,当RANUM生成的修复方法与开发商关于开源项目的固定方法相比较时,在40个案例中,在37个案例中,RANUM产生的固定方法相当于甚至比人类的固定方法更好。