In communication systems, there are many tasks, like modulation recognition, which rely on Deep Neural Networks (DNNs) models. However, these models have been shown to be susceptible to adversarial perturbations, namely imperceptible additive noise crafted to induce misclassification. This raises questions about the security but also the general trust in model predictions. We propose to use adversarial training, which consists of fine-tuning the model with adversarial perturbations, to increase the robustness of automatic modulation recognition (AMC) models. We show that current state-of-the-art models benefit from adversarial training, which mitigates the robustness issues for some families of modulations. We use adversarial perturbations to visualize the features learned, and we found that in robust models the signal symbols are shifted towards the nearest classes in constellation space, like maximum likelihood methods. This confirms that robust models not only are more secure, but also more interpretable, building their decisions on signal statistics that are relevant to modulation recognition.
翻译:在通信系统中,有许多任务,如调制识别,这些任务依赖深神经网络模型。然而,这些模型被证明容易受到对抗性扰动,即为诱导错误分类而制造的不易察觉的添加性噪音。这引起了对安全的问题,但也引起了对模型预测的普遍信任。我们提议使用对抗性培训,其中包括用对抗性扰动对模型进行微调,以提高自动调控识别模型的稳健性。我们表明,目前最先进的模型受益于对抗性培训,这缓解了某些调制家庭对调制的稳健性问题。我们使用对抗性扰动来将所学特征直观化,我们发现在强大的模型中,信号符号被转移到星座空间最近的舱层,就像最大可能性的方法一样。这证实,强性模型不仅更加安全,而且更便于解释,在与调控调识别相关的信号统计数据上建立它们的决定。