In recent years, deep learning (DL) models have achieved significant progress in many domains, such as autonomous driving, facial recognition, and speech recognition. However, the vulnerability of deep learning models to adversarial attacks has raised serious concerns in the community because of their insufficient robustness and generalization. Also, transferable attacks have become a prominent method for black-box attacks. In this work, we explore the potential factors that impact adversarial examples (AEs) transferability in DL-based speech recognition. We also discuss the vulnerability of different DL systems and the irregular nature of decision boundaries. Our results show a remarkable difference in the transferability of AEs between speech and images, with the data relevance being low in images but opposite in speech recognition. Motivated by dropout-based ensemble approaches, we propose random gradient ensembles and dynamic gradient-weighted ensembles, and we evaluate the impact of ensembles on the transferability of AEs. The results show that the AEs created by both approaches are valid for transfer to the black box API.
翻译:近年来,深度学习(DL)模型在自动驾驶、面部识别和语音识别等许多领域取得了显著进展。然而,深度学习模型对于对抗性攻击的脆弱性引起了社区的严重关注,因为它们的鲁棒性和泛化性不足。此外,可转移攻击已成为黑盒攻击的主要方法。在这项工作中,我们探讨了影响DL基础语音识别中对抗性示例(AE)可转移性的潜在因素。我们还讨论了不同DL系统的易受攻击性以及决策边界的不规则性。我们的结果展示了AE在语音和图像之间转移性的显著差异,其中图像的数据相关性较低,但语音识别则相反。受去除法的集成方法的启发,我们提出了随机梯度集成和动态梯度加权集成,并评估了集成对AE可转移性的影响。结果表明,这两种方法创建的AE对于黑盒API是有效的。