Recently, adversarial machine learning attacks have posed serious security threats against practical audio signal classification systems, including speech recognition, speaker recognition, and music copyright detection. Previous studies have mainly focused on ensuring the effectiveness of attacking an audio signal classifier via creating a small noise-like perturbation on the original signal. It is still unclear if an attacker is able to create audio signal perturbations that can be well perceived by human beings in addition to its attack effectiveness. This is particularly important for music signals as they are carefully crafted with human-enjoyable audio characteristics. In this work, we formulate the adversarial attack against music signals as a new perception-aware attack framework, which integrates human study into adversarial attack design. Specifically, we conduct a human study to quantify the human perception with respect to a change of a music signal. We invite human participants to rate their perceived deviation based on pairs of original and perturbed music signals, and reverse-engineer the human perception process by regression analysis to predict the human-perceived deviation given a perturbed signal. The perception-aware attack is then formulated as an optimization problem that finds an optimal perturbation signal to minimize the prediction of perceived deviation from the regressed human perception model. We use the perception-aware framework to design a realistic adversarial music attack against YouTube's copyright detector. Experiments show that the perception-aware attack produces adversarial music with significantly better perceptual quality than prior work.
翻译:最近,对抗性机器学习攻击对实用的音频信号分类系统造成严重的安全威胁,包括语音识别、语音识别和音乐版权探测等。以前的研究主要侧重于确保通过对原始信号进行小的噪音式扰动来攻击音频信号分类器的有效性。目前还不清楚攻击者是否能够制造出除了其攻击效果外人类还能清楚地察觉到的音频信号扰动。这对于音乐信号特别重要,因为它们是精心设计的,具有人类可喜的音频特征。在这项工作中,我们将针对音乐信号的对抗性攻击作为新的感知质量攻击框架,将人类研究纳入对抗性攻击设计。具体地说,我们进行人类研究,以量化人类对音乐信号变化的认识。我们请人类参与者根据原始和受扰动的音乐信号组合来评价其感知的偏差,并通过回归分析来反向增强人类感知觉过程,以预测人类对立性攻击的偏差,而不是以受到干扰的音频信号信号。然后将认知性攻击发展成一种最优化的模拟性攻击性框架。我们用一种最优的感化的音乐感预感测测测到人类的图像。我们的感测测测测测测测测到人类的图像。